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STATISTICS AND FOREIGN POLICY* 

Willard L. Thorp 
Aaaielant Secretary of Slate 

T O THE CABTOomsT, those who operate in the field of foreign policy 
fall into one of two categories. First arc the black-haired, gaunt 
and cunning conspirators whose complicated and devious scheming is 
for objectives intended never to be apparent by methods which are 
sinister and super-top-secret. The other category includes the tea¬ 
drinking, spat-wearing butterflies who are too vacuous to have an ob¬ 
jective and whose method is limited to charm, with the “r” relatively 
silent. Unfortunately, neither of these pen-and-ink conceptions even 
hints at the backbone of present-day foreign policy making and imple¬ 
mentation, namely, careful theoretical and factual analysis by skilled 
technicians. 

An increasing number of major problems of foreign policy are of 
such a character that measurement and magnitude become elements of 
basic importance. It is worthy of note that recently, when a delegation 
from the United Kingdom arrived to discuss with us a revision of the 
Bizonal Fusion Agreement for Germany, included in its impedimenta 
was a calculating machine. The classic “S” commodity list, beginning 
“shoes and ships,” has been out of date for some time due to the 
lamentable obsolescence of that colorful item known as scaling wax, 
but the now “F” list—^food, fuel, fibres, and fertilizer—^has top priority 
on the current foreign policy agenda of most governments and these 
items appear there almost entirely as problems of magnitude. Today 
calories get more attention than kings. The slide rule, the calculating 
machine, and the statistical yearbook have become necessary tools of 
diplomacy. 

The statistical puzzles anting out of foreign policy problems nat¬ 
urally are as varied as the universe. They are not limited by subject- 
matter, area, or time. What should be the number of whales permitted 

* Pr«sideniial address delivered at the 107th Armiial Meeting of the American Statistical Associa¬ 
tion on December 20» 1947. 
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Tinder the International Whaling Convention to be caught during the 
next whaling season in order to maintain a stable whale population, 
keeping also in mind the world shortage of fats and oils—a neat prob¬ 
lem in the vital statistics and caloric content of whales? What arc the 
proper cost-of-living-adjustment allowances to permit U. S. Govern¬ 
ment representatives in various countries to maintain approximately 
equivalent living standards—an index number problem with peculiar 
difficulties not merely for price level and foreign exchange reasons, but 
because of differing national customs of hospitality and patterns of 
protocol? What was the probable amovmt of destruction to American 
property in Italy arising out the War, and for which Italy must make 
partial payment under the Peace Treaty—a problem of statistical in¬ 
ference from quite fortuitous and incomplete sample data? What is the 
fair proportion between the United States and India for the shipment 
of raw cotton to Japan imder present controlled trade conditions—a 
problem in which the accepted guiding principle requires the finding of 
a “representative base period” out of a most abnormal series of years? 
What is the relationship between the availability of tobacco products 
in the Ruhr and the production of coal and steel—a problem of psycho- 
lo^cal measurement since the proposal as made by several Senators 
rests in the allegation that tobacco products are particularly effective 
as incentive goods? What amount of goods sent to Russia imder the 
Lend-Lease program was presumably imused and undestroyed at the 
end of the War and thus subject to a negotiated settl^ent—^a problem 
in war-time and peace-time property life-tables, and attrition and de¬ 
preciation rates? This random list of a few problems may serve to es¬ 
tablish the inference of the presence of statistics in the State Depart¬ 
ment, but the record will be clearer if we consider two illustrations in 
somewhat fuller detail. 

In a world where there are desperate Portages of commodities, the 
problem of allocation has become a matter Of prime importance. 
Countries have become competing purchasers, and even, in a few 
tra^c cases, competitors for reli^ assistance. The shortages arc widc- 
^read and severe, and for many commodities there are few countries 
with an exportable surplus. Foo^uffs are in this category and no gov- 
emmart with any claim to responability can look away while its people 
are himgry. The State Department has probably reedved more aide- 
memoires, notes, memoranda, and formal and informal visitationB 
from Prime Ministers, Ambassadors, foreign technicians, and even self- 
appointed representatives, concerning the subject of food allocations 
than any other angle topic during the last two years. The White House 
too has had dislm^uidied callers on the some subject. 
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The development and application of the concept of equitable food 
allocation, based on a careful examination of requirements and avail¬ 
abilities, was done first by a small Internationa] committee, then by the 
International Emergency Food Council and is now in process of l^ing 
taken over by the Food and Agriculture Organization, one of the spe¬ 
cialized agencies of the United Nations. This international body makes 
recommendations to the supplying countries as to the proper distribu¬ 
tion of their surpluses and these recommendations are followed with 
little variation. 

The problem is a most complex one. The basic unit for comparing 
food levels is the calorie, but unfortunately the simple definition in 
Webster that a calorie is the amoxmt of heat required to raise the tem¬ 
perature of one kilogram of water one degree Centigrade has not been 
so exact and indisputable when applied in the field of nutrition. Two 
caloric tables for valuing foods are in general use now, one by the 
U. S. Army and one by the International Emergency Food Council. 
There are at least half a dozen other tables used in various parts of the 
world. The two principal ones vary in caloric content from ten to fifteen 
per cent. Similarly, the effort to measure various types of grain in 
“wheat equivalent” opens the door to a thorough state of confusion. 
It is self-evident that when statisticians of many countries meet to¬ 
gether to discuss a ^ven problem, a primary requirement is that there 
be some common measure for setting down the facts and this has been 
a major task in the food field. Although this does eliminate one standard 
area for professional controversy, there will always remain enough 
other factors of disagreement to permit full self-expression. 

Obviously, the first information required for making international 
allocations is that pertaining to the requirement and the indigenous 
supply in each country. If these can be satisfactorily determined, the 
import requirement follows merely by subtraction. At once it is neces¬ 
sary to remark that in many countries where industrial production is 
higging the production of statistics is also below pre-war both in qual¬ 
ity and quantity. Unfortunately, statistical organizations have boon 
disorganized at the same time that the items to bo measured have 
boon subjected to wide variation. Even as basic a datum as population 
must be approached throu^ estimation. Substantial movements of 
peoples have taken place, in addition to the abnormal effects of war 
on both birth and death rates. And none of the devastated countries 
has. had the time or energy to take a post-war census, necessary to 
obtain a relatively secure benchmark. Similarly, agricultural produc¬ 
tion in most cases is measured much less accurately than before the 
war by the surviving statistical agendes in the various governments. 
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Anil even less certain, both abroad and at home, are the important 
estimates of wheat and coarse grains consumed by man and beast on 
the farm and thus not moving into the available supply. However, on 
the basis of the pre-war picture for which more reliable data are avail¬ 
able, and the records for the past two years, including ration levels, 
the amounts imported, and the apparent stocks on hand, estimated im¬ 
port requirements for the current year, quarterly and by months, are 
worked out regularly by the experts. 

On the other side of the equation, the possible export surplus, three 
of the chief exporting countries have regularly indicated their avail¬ 
abilities as best they can. Argentina, which provides about twenty 
per cent of the world’s exports of grain, has not participated in the 
effort to plan the most effective distribution of the available supply. 
This unfortunate situation has been met in part by the other exporting 
countries through adjustment of the allocation to offset supplies ob¬ 
tained from the Argentine. Russia’s exports, which have been limited 
in amount and have gone to very few recipient countries, have also not 
been subject to international allocation. 

But even the direct facts on immediate requirements and supplies 
are not raiou^ to solve the problem. Wheat, of course, is only one of 
many foods. It happens to be the cheapest form in which calorics can 
be purchased in substantial quantities. In allocating wheat, the inter¬ 
national committee must consider what other foods are available in 
the country with a wheat deficit. Obviously, it is quite proper in the 
light of the world shortage to cut down on wheat shipments to coun¬ 
tries which have a fair amount of other foods. Even then, however, one 
must have some regard for the necessities of balanced diets. Further¬ 
more, food habits and food requirements must be considered. Even in 
times of great need, a people docs not change its way of eating over 
night. New foods are not easily introduced, even to a starving people, 
and established prejudices are surprisingly tenacious. For example, 
com is not regarded as a proper food for human beings in a number of 
European countries, nor 7,re potatoes in the normal diet of Italy, while 
rice is consumed much more than wheat in Cuba. Differences arc not 
merely the result of taste. Countries with long winters have different 
requirements from those in the tropics. And even in our own experience 
as an occupying power, we have recognized that a much higher caloric 
requirement is needed for Germans than for Japanese to maintain the 
same health level. So the statistician, in figuring the amount of yrheat 
to allocate to any deficit nation, must have a dear picture of that coun¬ 
try’s historical eating habits and be familiar with its preferences and 
prejudices. 
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Finally, in considering the amount which is to be permitted to come 
from abroad, the allocations must not work in such a way as to punish 
the country which brings its maximum to the market-place, or all enter¬ 
prise and initiative in the direction of improved collections from the 
farms will be destroyed. Conversely, there must be some penalty for 
failure to use the indigenous supplies most efficiently. 

Since the beginning of the allocation procedure, it has always been 
true that the screened requirements for foodstuffs for all the deficit 
countries have totaled to substantially more than the availabilities, 
and here the really painful job begins—^the effort to determine where 
the requirements can be cut with the minimum of hardship. The figures 
of each country are reviewed again and again, and there are many 
conferences to explore vaiious aspects of the situation more thoroughly. 
Finally, the allocations are announced. At least, the process has made 
everyone aware of the limitations on supply and the urgency of the 
demands from other countries. 

These random comments about the allocation machinery may make 
the task appear exceedingly complex. But the fact lemains that the 
job must be done. These formidable calculations, aimed to take into 
consideration both the overall requirements and supply situation, and 
the peculiar circumstances in each case, are the only hope of providing 
some basis of fairness and equity in the distribution of scarce things to 
people who are in desperate need of them. 

There is no question but that living for millions of individuals for 
the next few years at least, will have to continue under rationing and 
allocations of critical, scarce commodities. The people in these coun¬ 
tries know that death from starvation is just as permanent as death 
from bombing. They know that allocations and rationing aie protec¬ 
tions to their lives—^that the rationing of milk, for instance, may cut 
down the number of fancy dishes served in fine hotels, but it does get 
the needed food to more mothers and infants for whom it is an essen¬ 
tial. The international allocations in the same way are an effort more 
nearly to equalize the burden of the shortage on the people of the vari¬ 
ous countries, not leaving the distribution solely to ability to offer the 
liighest bid or to the appeal of political sympathy, obligation or reward. 

The decision having been made to disregard economic bargaining or 
political discrimination and to place allocations on an objective basis, 
the key to this whole process becomes the little-heralded statisticians 
—^both those who must present the case for their countries, and those 
who must screen the competing claims and bring them into a reason¬ 
able relationship with each other. The day and night work and worry is 
theirs, but they have built in large part upon the work of other statis- 
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tiriana whose work had gone before. I hedtate to think how impossible 
it would be to handle this problem had it not been for the continued 
collection and analysis of agricultural statistics and nutritional data in 
many countries for many years. 

As a second illustration may I speak briefly about the European 
Recovery Plan? The last six months have seen a most difficult and 
complex statistical undertaking in Washington—^the examination of 
the requirements for European recovery and the study of the capacity 
of the Am^can economy and other economies to carry the European 
deficit in the meantime. This task has absorbed the full energies and 
capacities of many experts in many government agencies. The only 
rdief for the central group directing the project was temporary when 
in a lifter moment they decided to call themselves the Technical Wiz¬ 
ards on the European Recovery Program, or the TWERPS, for diort. 

Anyone familiar with Washington during wartime can easily visualize 
the time and energy required to develop the details of a plan involving 
sixteen countries and a four-and-a-quarter year period. I remember in 
the late twenties being told by a Russian economist about the tre¬ 
mendous efforts required and the manpower devoted to drawing up the 
five-year plan. Last summer, I saw French economists and statisticians 
in a state of near exhaustion from working on the so-called Monnet 
Plan. No one should regard an undertaking of this kind lightly. There 
have been no days, and even at times no nights, of rest. 

This project stems back, of course, to the su^estion by Secretary 
MarshaU that the countries of Europe get together, examine what their 
requirements will be over a period of time sufficient to permit thorn to 
put their economies on a self-supporting basis, determine how much of 
th^ requirements they can meet separately and collcctivoly, and thus 
indicate what additional help is needed to accomplish the program. 

Sixteen of the Western European nations mot in Paris last summer 
and, in an incredibly short period of weeks, drew up a program on which 
they could all agree. Undoubtedly, this agreement was possible because 
the essential elements of European recovery are beyond dispute—^that 
production must be substantially increased, soimd currencies must be 
established, and the restrictions on trade must be reduced. Their na¬ 
tional requirements were presented and assembled. After some slight 
screening and reduction where the requirements were clearly beyond 
the possibility of supply, the so-called deficit was calculated. This was 
all incorporated in the Import of the Committee of European Econom¬ 
ic Cooperation which, together with a number of technical annexes, 
was sent to Watiimgton in September. 
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Work was already well imder way in Washington by that time, par¬ 
ticularly with reference to the capacity of our own economy to meet 
such fordgn demands and the effect of such an operation upon our own 
economic operations. But the review of the European plan has proved 
to be a most complicated undertaking. Covering a period of four-and- 
one-quarter years, the program for each of sixteen countries and West¬ 
ern Germany had to be consistent as between its internal program and 
its export and import programs. For the total of all coimtries, the re¬ 
quirements from abroad and the availability of supply had to balance. 
Similarly, for the various individual commodities, demand and supply 
had to balance. For each country, its balance of payments had to be in 
equilibrium. And when supplies could not be obtained in the United 
States but could be foimd in other supplying countries, these prospec¬ 
tive sources had to be determined and incorporated into the pattern. 
In other words, the total pattern had to balance not merely as to an 
overall figure, but by conunodities, by country physical requirements, 
and by country balances of payments. Similar patterns had to bo pre¬ 
pared for each year within the period. And j^ally, the at^mpt to 
achieve both commodity and the balance-of-pasrments estimates in turn 
had to be broken down by currency areas in order to indicate the na¬ 
ture of the deficit with the dollar area. 

The number of arithmetical calculations which have gone into these 
estimates probably total more than a million. Five time periods are 
covered—^the last quarter of the present fiscal year, April 1 to Juno 30, 
1948, and the four successive years until June 30, 1962. Twenty-three 
areas were involved beginning with the rixtecn countries which par¬ 
ticipated at Paris, the dependent areas of the United Kingdom, Bd- 
^um, France, Netherlands, Portugal, and Western Germany in three 
parts—^the bizonal area, the French Zone excluding the Saar, and the 
Saar itself (since the Saar territory may shortly be incorporated eco¬ 
nomically into France). Twenty-six conunodity groups were selected 
for particularly intensive treatment, of which a nmnber such os iron 
and steel have to be built up from a series of separate categories—pig 
iron, scrap steel, iron ore, crude and semi-finished steel, tin plate, steel 
sheet and other finished steel—and for projections of the volume and 
value of trade covering movements of commodities among the par¬ 
ticipating countries, between the participating countries and the United 
States and other Western Hemisphere and other non-partidpating 
countries in turn. After the volume of this trade was derived, it had 
to be multiplied by prices to obtain values. While July 1, 1947 prices 
were used, it was necessary to ascertain prevailing prices in different 
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areas of the world on that date since the prewar assumption that 
prices for internationally traded commodities tend to be equal the world 
over has lost its validity under the conditions which prevail today. 

The task of combining the figures provided by the commodity 
committees into a coherent system from which balance-of-pa 3 rments 
estimates could be derived fell, as it happened, to the Department of 
State. An early courageous attempt was made to grapple with it by 
assembling the adding and calculation machines of which the Depart¬ 
ment can boast only a sparse and scattered population, and by amass¬ 
ing at the same time the clerical as^tance necessary to man or 
“woman” the machines. The attempt was futile. The traditions of the 
Department of State and its personnel training are oriented more to¬ 
ward the accurate and careful phrasing of a memorandum than the 
well-multiplied, checked and proven statistical table. Resort was 
necessarily had to punch cards, and automatic sorting and addition. 
It may be that the Foreign Office of 10 years hence will boast a full 
line of international business machines with operators in 24-hour at¬ 
tendance. As of today, it was necessary to work the calculations in on 
the graveyard shift at Cmsus, Bureau of Labor Statistics, and finally 
in the Department of National Defense. 

One further difficulty was to establish the price assumptions to be 
used for the future period. Here the crystal ball was particularly cloudy. 
The Paris Conference had used the prices then current, July 1, 1947, 
as the basis for both exports and imports for the filrst year (1948) and 
had then assxuned that European export prices would remain firm, 
while import prices would decline by 7| per cent in 1949,10 per cent in 
1950, and 12| per cent in 1951. The American reviewers have felt 
that the only way out of this dilemma was to present a range. Actu¬ 
ally, the bade calculations were made in July 1, 1947 prices, but the 
totds have been adjusted globally to meet different sets of price as¬ 
sumptions. 

For the first fifteen months, all exports from Europe and imports 
to Ehirope from the Western Hemisphere except XJ. S., are calculated 
at 5 per cent above July 1, 1947 prices, while U. S. and non-Westerm 
Hemisphere shipments to Europe are 7.5 per cent above July 1,1947. 
In the later years, the calculations on the high price assumption hold 
the price level constant with the level for the initial period, while the 
calculations for the low price assumption are based on a marked de¬ 
cline, particularly for the items imported to Europe. These different 
assumptions explain the presence of a range in the total requirement for 
the program of $15.1H(17.8 billions. 
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The projections are, of course, not blueprints which can be followed 
during operation. Actually, this is a sketch rather than a blueprint. 
Its purpose is to provide Congress and the public with as accurate an 
estimate as possible of what the program may in fact turn out to be, 
and the general magnitude of the requirements from abroad, if the 
European recovery program is to be accomplished. If some of the 
items are not available in the quantities indicated, there may be sub¬ 
stitution. If availabilities increase, prices will faU, or if they decrease, 
prices will rise. In either event, the dollars involved will tend to be 
more nearly constant than the constituent elements. And with so 
many commodities and countries, we can fall back on the protection of 
all statisticians, the hope that the deviations wiU be somewhat com¬ 
pensatory. 

In the original undertaking, five sets of questionnaires were drawn 
up by the European group to obtain information on food, fuels, ma¬ 
chinery, iron and steel, transport, and balance of payments. This 
information has been available to us in Washington, and to it has been 
added further information which we requested, plus the vast reservdur 
of knowledge accumulated in our government. However, it is unfor¬ 
tunately clear that there are some serious gaps in the ba^c information 
required. 

Even with the most complete information possible, there could be 
no assured results. The most that we c^n do is to achieve consistent 
and lopcal results from as reasonable assumptions as can be made. As 
I have already pointed out, assumptions had to be made as to price 
levels. Another uncertainty is created by the necessity to estimate 
crops. Should we assume that the weather will continue to be as un¬ 
cooperative with the fanner in Europe as it has been since the end of 
the War? And there are many other unknowns, such as at what point 
the processes of commodity hoarding will cease and money will be used 
again as a store of value. 

It is clear that the Recovery Program will have to be a dynamic 
and flexible operation. As was true during the War, programs wiU have 
to bo changed from time to time as conditions change, both as to coun¬ 
tries and as to commodities. To acMeve the most effective use of the 
available resources, there will need to be continued and detailed sta¬ 
tistical recording of progress made, and forecasts of the short and long- 
nm prospects. The injection of statistical methods into fordgn policy 
is therefore no temporary expedient—but promises to be a continiung 
necessity. 

Certain conclusions are now apparent concerning our capadty to 
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reduce the problems which I have been discussing to exact measure¬ 
ment. The first is the common complaint of the statistician—^that we 
do not have adequate data. This is, of course particularly true of the 
countries where the effects of the War are still fdt so severely. On the 
American side, we know much too little about the statistical quality 
and relevance of much of the foreign data. It is clear that these inter¬ 
national projects require cooperation and understanding between the 
statisticians of all participating nations. And time can be used up 
most rapidly if one is skipping about among long tons, short tons, and 
metric tons, not to mention bushels, quintals, hundredweights, barrels 
and Imperial gallons. 

On this point, that of the development of statistical data and the 
effort to achieve greater uniformity, there is much that the United 
Nations can do, supplemented by the private international statistical 
organizations. The establishment of a Statistical Commission by the 
United Nations and the international statistical meetings held in 
Washington last September may give us some encouragement. This is a 
long-time job—^it calls for continuous support and stimulation. If 
much is to be accomplished, the statisticians in the United States must 
take the lead. We must continually be prepared to demonstrate that, in 
this modem world, many problems can be faced properly and solved 
economically only when measurement is respected as a fundamental 
characteristic of the anal 3 ^. 

But beyond these points, there is a continuing frustration because 
too few relationships have been reduced to calculable form. It is ob¬ 
vious that planning really requires both cost and market data, that the 
requirements in the form of materials must be readily related to ca¬ 
pacity, that labor supply and working capital requirements must all be 
part of such contideration. Here the statistidan can make endless 
contributions, substituting detailed analysis for the rule of thumb or 
the experimental approach. In these matters, too much is at stake to 
be careless or casual. Nevertheless, our answers often contain far too 
much of the “rough estimate” and too little of the careful calculation. 

The same problems arise in connection with the use of statistics in 
the foreign policy field as in any other—^the struggle to do an honest 
and objective job, and the difiSculties of convincing others that such 
is the result. There are always those who would prefer to be guided by 
motion, prejudice, and preference. It is not at all surpridng that 
suspidons which continiudly cross national boundaries are not stopped 
by statistics. 

Nevertheless, the fact remains tiiat problems must be handled; 
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there must be answers; and statisticians, both here and abroad, can 
contribute much to finding the best answers. As a piofessional group, 
we have a clear-cut responsibility. First, we must continually strive to 
improve our performance through developing our own capacities and 
improving the raw material with which we work. But beyond that, we 
must persistently try to persuade others to take an objective view of 
the facts, to use to the full the techniques and capacities which we have 
to offer. We cannot promise to solve all the world^s problems. Many of 
them are not problems of measurement at all. But where possible the 
utilization of statistics, the reference to objective criteria, and the 
effort to measure before committing oneself to a line of action—^these 
are all ways in which rational men approach problems. And foreign 
policy should be no exception to the rule. 

Unfortunately, foreign policy is an area where it is all too easy for 
emotions and prejudices to be aroused, where problems all too often 
get subjective treatment. One approach towards international under¬ 
standing is to expose problems to the facts whenever possible—and 
they must be accurate and dependable facts. The battle between 
prejudice and analysis is our battle, in which we must supply much of 
the ammunition. Our responsibilities and opportunities have greatly 
increased in the international field. We not only can help in the de¬ 
velopment of knowledge about the world we live in, but we can actu¬ 
ally contribute substantially to that international understanding which 
is so greatly needed in the world today. 



HISTORY OF THE USES OF MODERN 
SAMPLING PROCEDURES* 


Fbederick F. Stephan t 
Princeton University 

I. INTEODXJCTION 

M odern sampling practice is the result of combining statistical 
theory with scientific knowledge about the material that is being 
sampled and with experience that has been gained through the use of 
various sampling techniques in surveys and experimental work. Its 
history stems from many roots and its applications branch out into 
many fields of science, commerce, manufacturing, agriculture, educa¬ 
tion, and government administration. In spite of its wide range of 
usefulness, sampling practice has been neglected in the training of 
statisticians, in the textbooks and treatises, and in the planning and 
analysis of most experiments and studies [1]. However, like Cinderella, 
it has risen from neglect to a position of well-deserved importance. 

The history of sampling practice provides a useful background for 
the discussion of present theory and applications. Dijficulties beset any¬ 
one who attempts to trace the developing uses of sampling because they 
are scattered throughout many branches of science and technology and 
are described, if at all, in subordinate portions of reports and articles 
whose titles provide no hint of what they may contain on the subject of 
sampling. Hence, this paper not only will touch each development veiy 
briefly and inadequately but no doubt it will miss entirely a number of 
highly important facts. (The author will appreciate suggestions for 
correction and addition.) The principal emphasis will be on the steps 
by which the present stage of progi'ess in applied sampling was reached 
and no attempt will be made to present a comprehensive picture of 
present-day uses or to appraise the relative value of each contribution 
to the accumulation of technical knowledge. 

There are two dangers that both author and reader must avoid in an 
historical review: (a) they may read into the records more than was 
actually there and (b) they may assume that nothing existed that is 
not given in the records. I hold to the view that the principles and 

* Presented at the 25th Session of the International Statistical Institute, a constituent part of the 
International Statistical Conferences. Washington. D. C.. September 6-18, 1947. This paper wiU also 
appear in the Proceedings of the Conference, which will be published in the near future. 

t This paper was prepared in connection with studie of sampling under the Committee on the 
Measurement of Opinion, Attitudes and Consumer Wants of the National Research Council and Socjivl 
Science Research Counoil. 
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technical practices of sampling emerged rather gradually from notions 
that wore simpler, sometimes inaccurate and confused, and often not 
recognized by those who held them as being of any special impoitance. 
Hence the records are fragmentary and there were doubtless many in¬ 
stances in which principles were developed by two or more workers in 
ignorance of the other’s work. I would not say that the same discovery 
was made “independently” for there is always a flow of common ideas, 
sometimes rapid and sometimes slow, in which all share. Hence it 
should be understood that the particular examples that will be men¬ 
tioned are part of a larger development of practice and theory which 
connected them in various known and imknown ways and often in¬ 
fluenced piofotmdly their characteristics and effectiveness. 

Finally, greater attention will be given to the use of sampling in 
large-scale surveys rather than to its use in the laboratory or in small- 
scale (local) experiments. 

n. EARLY EXAMPLES OF THE USB OF SAMPLES 

The eailiest examples of sampling procedures are to be found in cer¬ 
tain very ordinary human activities. The common practice of taking 
a small part or portion for tasting or testing to determine the charac¬ 
teristics of the whole precedes recorded history and is one of the roots 
from which sampling methodology stems. Stirring and mixing before 
taking the sample is a prototype of randomization. The efforts of 
scientists and their predecessors to draw conclusions about the laws 
of nature from what they could observe in their immediate environ¬ 
ment was a sampling process. Astronomy, perhaps the oldest of the 
sciences, started with the moon, the larger planets, and the stars that 
one could see from positions near the equator or north of it. Even to¬ 
day it is limited to those heavenly bodies that can be photographed 
with the best available instruments. 

All empirical knowledge is, in a fundamental sense, dciived from in¬ 
complete or imperfect observation and is, therefore, a sampling of ex¬ 
perience. An unusually interesting example, not unrelated to modem 
population research, can be found in Halley’s selection of mortality 
statistics in Breslau to form the basis of his life table. From this sample 
of one city he drew general conclusions pertaining to “the moitality of 
mankind” [2]. This was in 1693. Two centuries later, Sir John Lawes 
likewise used the annual record of wheat yields at Rothamsted on 5 
plots totalling 33 acres to estimate the change in yield x>er acre from 
1852 to 1879 for all of England and Wales [3]. While these examples 
were both statistical, all scientific observation, whether statistical or 
not, is based on sampling. 
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In spite of the mdespread practice of sampling in commerce and in¬ 
dustry and its utilization in scientific research, official statisticians have 
usually set as their goal the complete counting of a population and have 
sought to avoid any use of sampling or any inference beyond the bare 
description of that population at the time it was counted. Still they 
have been compelled to compromise, in the face of practical difficulties, 
on something less than complete coverage and often also on something 
less than adequate accuracy and quality in the information recorded. 
Such compromises have been common in the collection of many kinds 
of statistics, particularly those usually collected by voluntary reporting 
such as employment, wage rates, prices, and crop yields. It is also 
foimd in the collection of data that ought to be complete in order to 
satisfy certain non-statistical purposes, as for example in the re^tra- 
tion of births. Often the official statistician, faced with great difficulties, 
oppotition, and indifference on the part of the public and with only 
weak powers to support his efforts, could only compromise or resign 
his post. The result of the compromise was often a very crude and im- 
satisfactory form of sampling which strove to get as much information 
“as possible” and accepted what it got, instead of attempting to con¬ 
trol the selection by some of the means that were available. 

Interestingly enough, sampling has generally preceded the establish¬ 
ment of regular censuses. Prior to the first British Census in 1801 the 
fflze of the population of England was not known very accurately and 
various speculations were made and estimates prepared and disputed. 
Malthus projected his famous theory of population, and Adam Smith 
his Wealth of Nations, before the establishment of periodic national 
censuses. At least as early as 1754, estimates of population of England 
were made from the number of houses on the tax list plus a rude esti¬ 
mate of cottages not taxed, the total of dwelUngs being multiplied by a 
somewhat arbitrary factor of 6 persons per dwelling [4]. Other esti¬ 
mates were based on the reported number of baptisms, marriages and 
burials. 

In 1800 Sir Frederick Morton Eden estimated the population of 
Great Britain at 9,000,000 using sample data on the average number of 
inhabitants per house as well as the number of births. The first census 
of Great Britain in 1801 confirmed his estimate. Somewhat similar 
methods of estimation have been used between censuses in the U.S.A. 
as recently as the last decade [5]. 

About 1765 Messance, and 1778 Moheau, published very carefully 
prepared estimates for France based on enumeration of population in 
certain districts and on the count of births, deaths and marriages as 
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reported for the whole country. The districts from which the ratio of 
inhabitants to births was determined truly constituted a sample. 
Laplace prepared similar estimates in 1802 from enumeration of 30 
departments of France and the reports of births, deaths, and marriages 
from 1799 to 1802. This followed a plan he published in 1786. lie made 
a remarkable step forward in attempting to measiu-e the precision of 
his estimate and aimoimced that the odds were 1161 to 1 that it was 
not in error by more than 500,000 inhabitants or 12 per cent of the total 
population. Even though the method of estimate was crude and the 
measure of precision not wholly valid, Laplace’s effort was much more 
successful than the complete census of France that was attempted at 
the same time. A similar procedure had been employed in 1784-91 by 
the famous chemist Lavoisier to estimate the number of horses, cattle, 
sheep, and pigs, as well as the area under cultivation [6]. 

The foregoing examples suggest that modem sampling procedure 
might have developed at least a century sooner than it did if it had 
received more attention from the scientists of the day. Only the 
officials of statistical bureaus and other posts continued to concern 
themselves with the problems of measuring population, and they were 
preoccupied with difficulties in the classification and interpretation of 
data, in the administration of statistical bureaus, and in the important 
problems of trade, finance, industry, agriculture, public health, etc. 
for which statistical data were needed. Hence they favored complete 
censuses or the closest approach to them that was feasible. 

m. EXAMPLES 07 DEVELOPMENT 07 NEED 70B 
E77ICIENT SAMPLINO METHODS 

It would be interesting to explore the development of statistical 
work in all its many ramifications and inquire about the places in 
which modem sampling methods could have been used to good ad¬ 
vantage. That is beyond the possibilities of a paper such as this. In¬ 
stead examples will be given from the beginnings of four general lines 
of statistical work in which efficient sampling methods were needed: 
(a) agriculture crop estimates, (b) economic statistics of prices, wages, 
employment, etc. (c) statistical phases of social surveys and health 
studies, and (d) public opinion polling. The more recent developments 
of the use of sampling in these and other fields will be traced in suc¬ 
ceeding sections. 

(a) Agricultural crop and livestock estimates: At the present time a 
tremendous variety of statistics are collected in the United States on 
acreage planted in each principal or special crop, the estimated yield as 
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judged at successive dates during the growing season, actual yields, 
numbers of livestock, equipment, farm labor, marketing, and other 
aspects of agricultural production. Other countries collect similar 
statistics with vaiiations in scope, detail and frequency. In the U. S. as 
in many countries, the methods used for this purpose iuclude periodic 
censuses, special surveys, voluntary reporting by selected respondents, 
and records produced in connection with taxation, marketing and 
foreign trade. 

This vast system had its beg inni ngs about a century ago. The first 
U. S. Census of Agriculture was taken in 1840. At the same lime col¬ 
lection of agricultural statistics was begun by the Patent Office. 
Monthly reports were collected and published by the editor of the 
ATTimcan Agriculturist in 1862 and in 1866 regular reports on acreage, 
condition of crops, yield, and livestock were begun in the newly estab¬ 
lished Department of Agriculture. Annual reports of prices wei'c 
added in 1867, Thereafter the system developed into an extensive 
organization of agents and voluntary reporters that increased from 
4000 in the late 70’s to 300,000 in 1931 and a somewhat larger numl)cr 
now [7]. (The present system is described in a paper presented by Dr. 
C. F. Sarle at the International Statistical Conferences.) The methods 
of sampling were quite crude, many of the reports were simply based 
on the respondent’s judgment about conditions in his locality and the 
list of respondents was built up of fanners, agricultural agents, members 
of the staff of agricultural colleges, etc. who were willing to serve with¬ 
out pay and were located at places widely scattered over the country. 
The system was a compromise between the excessive cost and slow re¬ 
turns of complete enumeration on the one hand and the wholly un¬ 
satisfactory alternative of relying on reports issued by private specula¬ 
tors or going without current information, on the other. Various at¬ 
tempts were made tO improve the reliability of the statistics that re¬ 
sulted from these reports but they continued to be biased and of 
limited accuracy. Nevertheless they have been of great value to 
farmers and to other users of the statistics and they are the principal 
source of current data on agriculture. Hero as elsewhere the introduc¬ 
tion of modem sampling methods came late but is now making great 
contributions to the improvement of the statistical information in this 
field. 

(b) Economic staUstics: The history of economic statistics is a long 
and detailed story which can only be sketched here in inadequate out¬ 
line [8]. Again, as for population and agriculture, there were early at¬ 
tempts to make the best of scanty and fragmentary material. Censuses 
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and special investigations have provided increasingly valuable data but 
at long intervals and in inadequate detail. 

In many countries, economic statistics derived primarily from the 
records produced by taxation and customs duties. This was true in 
the United States. Statistics about subjects that were not provided by 
the system of taxation had to be obtained by other means. More than 
a century ago the collection of reports of prices, wage rates, hours of 
work, employment, and production was begvm with voluntary report¬ 
ing by employers and occasional special surveys as the principal 
methods. Massachusetts established a statistical bureau in 1869, 
Pennsylvania in 1870, and other states at later dates; the Federal 
Government set up a Labor Bureau in 1884. These bureaus developed 
staffs of field investigators to collect data at monthly or other peiiods 
but reUed heavily on mail questionnaires for many of their principal 
scries. This continues to be the prevalent procedure although the 
establishment of social insurance systems and employment exchanges 
has provided other som'ces of statistics on earnings and employment. 

Similar systems of collecting reports are operated by other agencies 
concerned with production, trade, and finance. These inquiries have 
been based on simple sampling procedures such as the use of a scat¬ 
tered group of voluntary reporters, except for certain relatively recent 
surveys. A serious effort was made, for example, to get wholesale 
prices "in representative markets” and to get labor data by sending 
agents into various districts with a list of employers from which they 
could choose what they believed to be a representative group, but the 
techniques of selecting a representative sample had not been developed 
as formal procedures. Here again modem sampling procedures have 
been introduced late but are making noteworthy contributions, in con¬ 
junction with other improvements, to the accuracy and value of these 
statistics. 

(c) Socud surveys and health surveys: The distinction between social 
surveys and certain related kinds of inquiries is difficult to make. Some, 
like LePlay's nomographic studies, are very intensive observations of 
a few individuals or families; others include thousands of cases but less 
intensively and with more attention to statistical analysis. Williams 
and Zimmerman listed 1500 studies of family living in the U. S. and 
elsewhere, made prior to 1935 [9]. Booth’s Survey of London life and 
Labour, Rowntree’s survey of poverty in York, the reports of the 
Immigration Commission in 1907, and the Pittsburgh Survey of the 
Russell Sage Foundation are examples. While complete and thorough 
investigation of the problems was the ideal to which such surveys 
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aspired, they were necessarily limited to a selected city, and to some¬ 
what fragmentary data on pertinent questions by the sheer impossibil¬ 
ity of a comprehensive canvass. Other surveys were made under the 
inspiration of their example but progress in methodology came slowly 
and as in many other investigations the accuracy of the results was 
seldom tested. 

Sampling has been used extensively in studies of poverty and un¬ 
employment. During the depression of 1873-79 Carroll D. Wright used 
police in 19 cities and wrote to assessors in 375 towns throughout 
Massachusetts inquiring about the number of unemployed. His esti¬ 
mate of 28,508 corrected the current assertions that 200,000 or 300,000 
workers were out of work [10]. Similarly in 1873 the number of unem¬ 
ployed in New York City was estimated on the basis of reports from 400 
volunteer visitors of the Association for Improving the Condition ot 
the Poor. These procedures were, of course, crude and inaccurate but 
they reflected a praiseworthy desire to measure what was being dis¬ 
cussed in still less accurate terms by important public leaders. In 
1893, unemplo 3 raient was estimated indirectly by the decline in fac¬ 
tory employment from its highest figure in a series derived from a 
sample of reporting employers constituting 70 per cent of the produc¬ 
tion in the state [11]. In 1893, estimates were collected by mail from 
respondents in 119 cities by Bradstreet. In New York City, Chicago 
and other cities, police and health inspectors surveyed samples of 
homes or factories [12]. In the depression of 1914-15, many surveys of 
unemployment were made, notably by canvass of the workers insured 
by the Metropolitan Life Insurance Co. through its agents in 29 cities 
[13] and by canvass of 104 city blocks and 3703 additional tenement 
houses by the Bureau of Labor Statistics using about 100 tenement 
house inspectors. Both agencies repeated their surveys in New York 
Qty eig^t months later [14]. 

In the depression of 1921-22, somewhat similar methods were used to 
estimate unemployment and, in general, while the results w'cre far from 
satisfactory, there was improvement in the identification of the un¬ 
employed as distinct from other needy groups and increased detail in 
the information collected. Especially noteworthy as an advance beyond 
previous practice was the survey conducted in Columbus, Ohio, by 
Frederick E. Croxton and Mary Louise Mark using 100 university 
students to canvass three districts within the city selected by the presi¬ 
dents of the Federation of Labor and the Chamber of Commerce as a 
“fair sample of the wage-earning population of the city” and including 
approximately 10 per cent of the wage-earning population. The survey 
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was repeated annually from 1922 to 1925 [15]. A mmil ar procedure was 
used in Philadelphia by J. Frederic Dewhurst and Ernest A. Tupper 
who had school attendance officers canvass 166 small districts selected 
to provide an accurate cross-scction of unemployment [16]. This survey 
was repeated annually during the 30’s and provided several kinds of 
data not previously available. The Columbus procedure was used in 
1929, 1930, and 1931 in Buffalo, N. Y. by Croxton [17] and in 1931 in 
Syracuse, N. Y. by John N. Webb [18]. 

Among numerous health surveys made in the U. S. from time to 
time, several examples may be mentioned that resemble the preceding 
surveys in their methods of sampling. In 1921, the U. S. Public Health 
Survey selected Hagerstown, Md., as a conveniently located and 
“fairly typical small city in the East not greatly influenced by im¬ 
migration” in which to study sickness and fertility by periodic visits 
to a sample of households [19]. In 1929-30 the Committee on Costs of 
Medical Care made many sample studies, including one on the in¬ 
cidence of illness among 9000 families [20]. The U. S. Public Health 
Service and Milbank Memorial Fund studied sickness among those 
elements of the population that had borne the brunt of the depresaon 
by selecting severely effected districts in 10 localities, omitting the 
slums, Negro, and well-to-do sections [21]. 

(d) Public opinion polls: The practice of surveying public opinion 
emerged from simple but obscure beginnings in the “straw vote” 
conducted by newspapers to sense public reactions to candidates and 
obtain human interest stories by interviewing the “man in the street.” 
Even before 1900 the New Yorle Herald collected pre-election reports 
and estimates from all over the United States and attempted to fore¬ 
cast the outcome of elections. In 1904 it polled 30,000 registered voters 
in New York City. In 1905 the Chicago American and Chicago Journal 
took a poll during the mayoralty campmgn and the Columbus DispatxJi 
started a long scries of polls by taking one in the state election of 1906. 
In 1912 papers in Boston, Chicago, Cincinnati, Denver, Los Angeles 
and St. Louis conducted a nation wide poll in 37 states. Four years 
later, this group of papers repeated its presidential election poll and 
another poll was taken by the Hearst newspapers. The Eexall drug 
store chain took a nation wide poll in 1920 using its stores as centers to 
collect the ballots. These and other efforts were outdone in aze by Tha 
Literary Digest which started polling in 1916 and in 1920 mailed out 
11,000,000 ballots to persons whose names ware taken from telephone 
directories and other lists. Two years later it conducted a poll on pro¬ 
hibition. Other polls were taken at the rate of about one every year and 
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a half. Robinson found that more than 86 straw polls were taken in the 
presidential campaign of 1928. Although these polls employed quite 
crude methods, they were successful in many instances because a 
moderate error only affected the prediction of the winning candidate 
when the election was close and because among many attempts some 
might be expected to turn out well. The sources of error in straw polls 
were analyzed in detail by Robinson in 1932 [22]. 

Closdy related to the polls were market research studies and con¬ 
sumer surveys conducted by business concerns, publications, and ad¬ 
verting agendes. The results and methods of these studies received 
less publicity than the polls but they offered even greater opportunities 
for sampling than the straw votes. There was also a growing interest 
in public opinion research among political scientists, sociologists, and 
others which led to scientific interest in the improvement of the tech¬ 
nique of opinion polling [23]. 

The foregoing discussion was limited to uses in which the sampling 
methods were quite simple, indeed often crude and inefficient, and not 
based on explicit condderations of probability. The dividing line is not 
sharp between them and some of the other surveys that will be included 
in later sections but they represent fairly well the pioneer stage in the 
emergence of modem sampling practice. This stage might also bo 
illustrated by examples of experiments, surveys, and studies in other 
fields in which sampling in some form was used. 

While, in these early instances, the sampling procedures were simple 
and usually employed uncritieally with no great attention to accuracy 
and representativeness, it should be noted that the problems of ob¬ 
serving and recording data were almost always far more sciious than 
the problems of sampling. Modem sampling procedures, had they been 
available and had they been applied effectively, would have permitted 
the use of smaller samples and made possible in most coses more care¬ 
ful fieldwork. Through these developments and those in other fields of 
statistioal study, there arose a great opportunity for the use of better 
sampling procedures that has been exploited only in part up to tho 
present time. 

IV. DEVELOPMENT OF THE USE OF THE SIMFLEB BAKDOM AND 
8TSTEMATIC SAMPLING TECHNIQUES 

Much of modom sampling practice rests on processes of selecting 
individuals at random or according to certain systematic procedures. 
These processes may be employed as providing a simple method for 
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selecting a sample or they may be used as parts of a more complex 
sampling scheme. 

Random sampling utilizes the devices of games of chance or other 
methods that assure to each unit in the source from which the sample 
is drawn an equal probability before the draAv that it will be included 
in the sample. It has the advantage of facilitating certain applications 
of the theory of probability to the analysis and use of the results. 

Although the theory of probability was well established in the 
eighteenth century, its applications to the practical drawing of 
samples were delayed imtil the tiventieth. Applications to games of 
chance and lotteries can be excluded from consideration since they were 
not concerned with estimating the characteristics of a population from 
a sample. Likewise, applications to the interpretation of data as having 
been produced by a random sampling process from a hypothetical 
universe can be excluded since they did not involve the actual practice 
of drawing samples. Also trials with dice or cards designed to “verify” 
known probability laws or merely for expository purposes are not 
examples of sampling practice. Doubtless there are many instances that 
have escaped the author’s notice, but the earliest instances he has 
found are Bowley’s sampling of a list of bonds and their interest rates 
in 1906 using the final digits in one of the tables in the NatUicai Alma¬ 
nac to make the random selection and Student’s testing of the f-dis- 
tribution in 1907 by drawing cards from a large receptacle. Random 
sampling was facilitated by Tippett’s tables of random numbers 
published in 1927 [24]. The practical importance of random selection 
slowly gained recognition, especially in connection with more complex 
procedures that were developed after 1920 and will be discussed in the 
next section. 

Systematic sampling employs a simple rule of counting cases in some 
convenient order and selecting for the sample every nth case, or of 
using some similar pattern of selection such as taking material at 
measured inteiwals or drilling samples out of a large mass of metal by 
use of a template that locates the holes. More complex rules or patterns 
may be used and the starting point may be selected Ijy a random choice 
or special rule. The theory of probability can also be applied to the 
results of systematic sampling under certain conditions. Bowley and 
others have interpreted it as a case of stratified random sampling and 
estimated sampling errors accordingly. 

Systematic sampling has long been in use in taking samples of ore 
from the face of a vein in a mine or samples of metal from a pig or ingot. 
The practice of “cruising” a forest and sampling trees at uniform 
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intervals has been well established in forestry for many years [25]. 

Systematic selection was utilized by A. N. Eiaer in his survey of 
Norwc^n workers in 1895. As a means of facilitating special tabula¬ 
tions from census schedules, it was used in a study of family data in the 
1900 Norwe^an census, a study of marriage in the Danish census of 
1901, and a housing study for Oslo in 1913-14 [26]. 

Kiaer’s work apparently had little influence on surveys in the U. S. 
or in Europe. Apart from the discussions he initiated in the BvUetin 
of the International Statistical Institute from 1896 to 1903, it was 
mentioned only briefly by Edgeworth in his Presidential address before 
the Royal Statistical Society in 1912 and in the U. S. by W. B. Bailey 
and F. S. Chapin in 1906 and 1920 respectively [27]. It had a greater 
effect in certain continental surveys [28]. 

In 1909 Bowley included a brief chapter on Sampling in his Ele¬ 
mentary Manual of Statistics making general references to sampling in 
commerce, mining, and industry, but not to specific sample surveys. 
However, in 1912 he took every twentieth working class household in 
Reading in a study of poverty and his associates did the same for three 
other cities in 1913 [29]. He computed probable errors of sampling and 
recognized other sources of error in the interviewing, definitions, and 
process of estimating. This study was followed by a systematic sampling 
of census schedules in 1915, and by similar surve 3 rs of Liverpool (1930) 
and Merseyside (1931) by Caradog Jones, Southhampton (1927f) by 
Ford, and London (1929) by the London School of Economics under 
Bowley’s direction [30]. 

John Hilton made a series of studies of workers in the xmcmplo 3 rment 
insurance system beginning in 1923 [31]. They were selected systemati¬ 
cally from the files of the Labour Exchai^es. In addition to the in¬ 
formation in the records some data were obt^ed by interviewing 
workers who came to the exchange. Certain deviations from strictly 
systematic selection introduced biases which were reduced by sub¬ 
sequent improvements of the method. Hilton found a sample of only 
one per cent quite satisfactory to meet the practical administrative 
and policy-making purposes for which the studies were made. The re¬ 
duction in expense that resulted from sampling such a small proportion 
of the recor<k was indeed imprestive. Oddly enough the method was 
not imitated to any great ^dent by other government bureaus. 

Another example of the great usefulness of sampling to get quick 
inexpensive tabulations was furnished by the Japanese who took a 
sample of one in 1000 from thdr 1920 census to get data quickly after 
the earthquake of 1923 [32]. Subsequent checking when the usual 
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tabulations were completed verified the accuracy of the sample results. 
Further examples of sampling census schedules for special tabulations 
are Day Monroe’s study of Chicago families in the 1920 Census and 
R. F. George’s study of British workers in the 1931 Census [33]. 

In spite of this successful experience, the use of random and sys¬ 
tematic sampling procedures in statistical work made slow progress. 
At the Rome Session of the International Statistical Institute in 1925 
a resolution was adopted recommending the use of sampling for 
statistical purposes with appropriate precautions as to its representa¬ 
tiveness, mathematical statement of the precision, and full description 
of the methods employed. The reports that were submitted by the 
commission that drafted the resolution presented evidence of the use¬ 
fulness of sound sampling procedures but both the reports and the 
resolution had less immediate effect on statistical practices than might 
have been expected. Eight years later, in 1934, Ncyman revived the 
discussion of these reports in a notable paper before the Royal Statisti¬ 
cal Society. 

In America, serious attention was given to problems of sampling 
methodology by several committees of the Social Science Research 
Coxmcil. There was a strong current of interest among the sociolo^sts 
including Chaddock, Ross, Ogbum, Stouffer, Lundberg, Dorothy 
Thomas, Stephan and others [34]. Margaret H. Hogg, who had worked 
under Bowley’s direction on some of the British surveys, came to 
America to the staff of the Russell Sage Foundation and there made a 
critical study of employment and unemployment statistics. In an 
article in the Journal of the American Statistical Association she made 
a strong plea for rigorous methods of sampling and cast doubt on the 
value of surveys such as those that had been made in Philadelphia and 
Buffalo, in which the sample was selected by judgment rather than 
random procedures. She was equally concerned about the adequacy of 
the classification of workers into the various categories of employment 
and unemplo 3 mcnt and the analysis of the data in tcims of significant 
questions. In the spring of 1931 Miss Hogg made a survey of unem¬ 
ployment in New Haven, Connecticut, partly for the purpose of testing 
the practical difficulties of applying a random sampling method and 
also developing better schedules and statistical categories for unom- 
ployment surveys [35]. 

Two very important general developments affecting the use of 
sampling in the United States occiured in 1933. One was organization 
of large-scale work projects for the unemployed under the national 
programs of the Federal Emergency Relief Administration (1933-35), 
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Civil Works Administration (1934-35), and Work Projects Administra¬ 
tion (1935-40). The second was the enlistment of many leading 
statisticians from the universities and butiness in the reorganization of 
government statistical work and administration of emergency agencies 
[36]. The statistical needs of the government increased tremendously as 
it took active steps to meet the problems of the depression and under¬ 
took various New Deal programs. Since the American statistical system 
is decentralized, it became necessary to establish a Central Statistical 
Board to coordinate and regulate the statistical activities of many 
agencies engaged in collecting data for their own purposes or for gen¬ 
eral use. The Board was largely an outgrowth of the Committee on 
Government Statistics and Information Services, an advisory group 
organized by the Social Science Eesearch Council and the American 
Statistical Association at the request of the Secretaries of Agriculture, 
Commerce, Interior and Labor to assist them in overhauling the statis¬ 
tical work of thtir Departments. 

The great variety of statistical work done by the regular agencies, 
the National Recoveiy Administration, and other emergency agencies 
far exceeds the scope of this paper. Only a few large scale surveys can 
be described briefly as examples of those of their projects that were 
based on sampling; many smaller and more specialized studies were 
also undertaken. A number of diflSculties prevented these projects from 
embodying fully the best sampling procedures then known and from 
exhibiting the range of uses to which sampling methods could be put. 
Most of the work was started with inadequate time for planning and 
preliminary trials. There were, curiously enough in a period of unem¬ 
ployment, severe shortages of persons technically competent to super¬ 
vise the work. Moreover the field workers and office staffs wore drawn 
largely from the unemployment relief lists and worked under restric¬ 
tions as to hours and rates of pay. In spite of these difficulties the re¬ 
sults contributed greatly to many very important programs and pro¬ 
vided detailed statistical information where there had been little or 
none before. 

(1) The Financial Survey of Urban Houting was taken in conjunc¬ 
tion with the Real Property Inventory, a complete canvass of bousing 
in May 1934 covering 60 dties. The Financial Survey was taken by 
interviewers who vitited all the families in one block out of every ten 
in larger cities and one in seven in smaller cities. Additional schedules 
were obtained by mail from families in another four blocks out of ten. 
The survey resulted in a 1200 page report of detailed financial data 
and was unique as an early instance of the use of sampling procedure 
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for obtaining special information as part of a single complete canvass 
[37]. 

(2) A series of research studies of workers on relief and related sub¬ 
jects were undertaken by the FERA and WPA in selected cities and 
counties from 1933 to 1940 using a variety of methods and samples. The 
localities were selected in part by judgment of their representative¬ 
ness of different types of situations that were to be compared, in part 
by administrative considerations of availability of supervisors, non¬ 
interference with other projects, etc. Systematic sampling was used ex¬ 
tensively within the localities for many of these studies and the samples 
were appraised critically in the reports. One of the studies was an 11 
per cent national sampling of workers on relief in March 1935, with 
proportions varying from state to state, providing very detfdled tabu¬ 
lations by age, occupation, and education. The accuracy of the sample 
results was verified by comparison with subsequent complete tabula¬ 
tions and a table of sampling errors was given [38]. 

(3) The Study of Consumer Purchases was conducted in 1935-36 by 
the Bureau of Labor Statistics in 32 cities and the Bureau of Home 
Economics in 19 small cities and 206 villages or rural counties [39]. 
The study was a work relief project and other agencies participated in 
the planning and analysis. In selecting the localities, available data on 
their economic characteristics were used to achieve a high degree of 
representativeness; within each locality families were selected sj^ 
tematically from city directories, the schedules of the Real Property 
Inventory (New York City), the 1934 WPA Census of Chicago, and 
similar lists. The smaller areas were canvassed completely. From the 
first simple schedules obtained for 783,000 families a second sample 
was selected for interviewing about details of income and expenditures. 
This sample was designed to provide adequate numbers of families 
in certain relatively rare categories without taking more families in the 
more prevalent categories than were needed for comparisons between 
the various types and kinds. The sampling procedure was complicated 
by the necessity of interviewing in such a manner that if the work were 
terminated at any stage in the process families that had been vMted 
would still be a representative sample. This project far exceeded any 
previous study of its kind in size and complexity. The procedmes 
and problems have been described rather fully in the reports and else¬ 
where. They follow a basic plan that was prepared by the Social Science 
Research Council in 1929 [40]. 

(4) The National Health Survey was conducted by the TJ. S. Public 
Health Service in 1935-36. It covered 83 cities in 18 states with samplii^ 
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ratios varying from 1 in 38 for New York City to complete canvass of 
the smaller places. The sampling was done by dividing each city into 
small areas of less than 1000 persons each using the 1930 Census 
enumerators’ maps as the basis, and then drawing a systematic sample 
of these small districts for complete canvass. The Survey supplied a 
great amount of data on illness and medical care. Its schedules were also 
tabulated in great detail by the Social Security Board to obtain cross 
classified information about family composition and characteristics for 
actuarial studies. 

(5) The Michigan Census of Population and Unemployment was 
taken early in 1935 by the State Emergency Welfare Relief Commission 
as a WPA work relief project [41]. The sampling procedure involved 
complete enumeration of all cities between 3000 and 40,000 inhabit¬ 
ants and a 20 per cent random sample of smaller places and rural 
townships. Cities of more than 40,000 inhabitants were sampled by 
taking all dwellings with certain predesignated house numbers accord¬ 
ing to a scheme developed by S. A. Stouffer. The reports not only pro¬ 
vided detailed data on unemployment in various parts of the state, but 
also unusual information on migration that was subsequently analyzed 
by John N. Webb and Albert Westefeld [42]. 

(6) The Minnesota Income Study was a work-relief project spon¬ 
sored by the Minnesota Resources Commission and operated in 1938 
and 1939 using a complete system of stratified sampling with a sys¬ 
tematic selection of households in urban areas and a random selection 
of two-square-mile areas in rural districts [43]. 

(7) The New York City Youth Survey was made by the Welfare 
Council of New York City in 1935 taking every hundredth household 
from the lists of the 1934 Real Property Inventory supplemented by a 
similar sample of dwellings built after the Inventory [44]. It was also 
a WPA work relief project. All members of these households between 
16 and 24 years of age were interviewed about their leisure activities, 
employment, schooling, and other personal data. 

(8) The Continuous Work History Sample is a 4 per cent sample of 
workers covered by old age insurance maintained by the Social Se¬ 
curity Board in connection with its records of earnings and contribu¬ 
tions to the insurance scheme [45]. Beginning with 1937 all reported 
earnings and related information are compiled for this group of 
workers. They were selected by use of certain combinations of digits 
in their accoimt numbers. The sample provides annually distributions 
by age, sex, race and taxable wages that are needed for actuarial pur¬ 
poses as well as useful for general economic studies. 
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(9) The 1937 Enumerative Check Census of 'Crnemplo 3 rment was 
conducted by the Director of the National Unemployment Census 
who had been ordered by an act of Congress to undertake a voluntary 
census of the unemployed by mail [46]. The Check Census was designed 
to determine what proportion of the unemployed had actually regis~ 
tered and thereby provide more accurate estimates of the volume of 
imemployment than had been available previously. The sample was a 
systematic selection of all households in 2 per cent of the postal carrier 
routes in the U.S.A. About 18 per cent of the population, not served by 
carrier delivery, was excluded but was taken into accoimt in the es- 
timates. The 510,000 households in the sample were visited by postal 
carriers to obtain data of employment and imemployment and a special 
count was made of the number of workers and unemployed persons 
for the routes that constituted the sample. From these data it was 
detei'mincd that 71 per cent of the unemployed had registered and es¬ 
timates were prepared for separate classes of workers with correspond¬ 
ing calculations of the sampling errors of the estimates. The analysis 
of the estimates and their errors was especially noteworthy. 

The foregoing examples exhibit progress in utilizing sampling tech¬ 
nique for statistical surveys of exceedingly important national prob¬ 
lems. They reflect the trend toward greater use of detailed statistical 
data and research as a basis for determining policies and establishing 
programs on a city and state wide basis as well as nationally. They 
are impressive lor the size of the samples taken, but there were also 
many smaller samples of comparable importance. In two ways, they 
were a result of the depression: (a) in the urgent demand for statistic^ 
data generated by the emergency and by the New Deal programs, and 
(b) in the funds and relief personnel that were made available. How¬ 
ever, these influences tended toward haste and inadequate prepara¬ 
tion, towoi'd the use of loss eflicient sampling schemes in order to meet 
problems of supervision and the restrictions on expenditures for travel 
by interviewers, and toward typos of inquiries that were suitable for 
the kinds of interviewers that were available. Problems of cost and 
economical operation were considered but were affected by the primary 
purpose of providing useful work for unemployed white collar workers. 
This tended to discourage intensive analysis or research studies not of 
a relatively routine character since the amount of emplo 3 rment provided 
in relation to the kind and amount of supervision required was smaller 
for such projects. Finally sampling problems received only secondary 
emphasis because the other problems connected with the surveys were 
much more serious and time consuming. The results of these survesrs 



AMBBICAN STATISTICAL ASSOCIATION 


were of very great value and to a large extent this was due to the use 
of sampling procedures. 

Coupled with the direct effect of depression problems in stimulating 
the use of sampling was the trend of thinking among statisticians and 
social scientists who, during the late 20^s and early 30’s, became in¬ 
creasingly interested in applying error theory to the analysis of time 
series and other data [47]. This trend of interest was well represented 
among many of the men who participated in the work of the Commit¬ 
tee on Government Statistics, and recommended in 1933-35 greater 
and better use of sampling procedures in government statistical work 
for such purposes as (1) checking the coverage, accuracy, and complete¬ 
ness of censuses, (2) testing the representativeness of current reporting 
systems (crops, cost of living, payrolls, employment, production, etc.), 
(3) supplementing the regular schedules in censuses to relieve them of 
overloading, (4) obtaining prompt national and regional figures on 
such subjects as imemployment, and (5) making very detailed intensive 
studies in which the emphasis is on the analysis of relationships [48]. 
The Committee also recommended that the staff of the Central Statis¬ 
tical Board and the Bureaus of Agricultural Economics, Census, and 
Labor Statistics include expert advisers on sampling proceduie whose 
formal professional training in the technical phases of sampling was 
comparable to that of chemists, physicists or biologists in the govern¬ 
ment service. These agencies did move to accomplish what had been 
recommended. They experimented with trial surveys of unemploy¬ 
ment, agriculture, constiuction, retail prices, and basic studies of the 
problems involved in developing practical sampling systems [49]. 

By this time sampling practice had outgrown the simpler methods of 
random and systematic sampling and was developing complex sampling 
systems specially designed to fit the nature of the population that was 
being sampled, the costs and administrative factors, and the principles 
of eflBicient design that had evolved in scientific experimentation and 
industrial production. 

V. EXPERIMENTAL COMPARISON OP METHODS AND DEVELOPMENT OF 
COMPLEX SAMPLING PROCEDURES 

Modem sampling practice utilizes a variety of devices and methods 
which have been tested experimentally and incorporated into sampling 
theory. The aim of modem practice is to select those methods which, 
when combined in the most appropriate manner, will constitute a 
sampling system that is economical, convenient, and accurate in the 
situation for which it was designed. Among the devices available for 
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the construction of such a sampling system are: (a) the technique of 
selecting items at random, (b) the determination of the kind of sampling 
unit that will have the most desirable properties, (c) subdivision or 
stratification of the population, in an advantageous manner, (d) the 
use of variable sampling proportions, (e) subsampling and multistage 
sampling in which the sample is itself sampled in turn, (f) drawing two 
or more units from each ultimate subdivision to permit estimation of 
the sampling error from their differences, (g) use of information pro¬ 
vided by variables that are correlated with the one that is being stud¬ 
ied, and many other procedures. 

It is difiSiCuit to find the original source of many of these techniques 
but it is clear that they were, to a large degree, crystallizations of no¬ 
tions suggested by common sense, customary practices, and practical 
problems encountered in sampling. Discovery of their importance and 
success in combining them into efiicient sampling schemes has required 
a clear understanding of their properties and a considerable amount of 
ingenuity. Beyond that, careful experimentation has been necessary 
to test their relative merits vmder various conditions. Thdr develop¬ 
ment has occurred primarily in connection with agricultural experi¬ 
mentation, mass production industry, and large-scale surveys. 

(a) AgricuUural experimentation: One of the first steps in the ex¬ 
perimental comparison of sampling schemes was the study of hetero- 
gendty in the fertility of a field and the consequent correlation of 
adjacent portions. In 1910 and 1911, very important work was re¬ 
ported by “Student,” Wood and Stratton, and Mercer and Hall 
[50], leading to the conclusions that in field trials there is always some 
experimental error, that it can be reduced by taking a large number of 
small plots and, that, in the comparison of two varieties or treatments, 
it can be reduced by taking these plots in pairs, adjacent to each other, 
and u^g one plot out of each pair for each variety or treatment. The 
latter device was adopted so as to take advantage of the correlation 
between adjacent areas. Plots of different sizes and shapes were com¬ 
pared u^g the results of a carefril harvesting and measurement of man¬ 
golds and data on 400 reports of duplicate plots reported by other in¬ 
vestigators. These results led to similar experiments in England, Amer¬ 
ica, and elsewhere [51]. Various systematic sampling schemes were 
proposed and tried experimentally, both for locating plots and sub¬ 
sampling the plots in order to r^uce the labor of harvesting them 
completely. In 1929, Clapham and Wishart published the results of 
experimental comparisons between certain random and systematic 
methods of sampl^ a field of potatoes or cereals and analyzed the 
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factors that contributed to the error of the estimate of yield [52]. In 
the same year Smith and Prentice published results of a field study in 
which they took samples of soil and then subsampled for laboratory 
examination. In their analysis of the results they estimated the increase 
in error due to subsampling [53]. R. A. Fisher, whose advice and 
methods of statistical analysis had been followed in many of the experi¬ 
ments in the 20's and later, developed complex systems of experimen¬ 
tation of great efficiency and revolutionized experimentation [54]. 
Yates and Zacopanay published in 1935 a thoroughgoing analysis of the 
efficiency of sampling a field, including comparative labor costs [55]. 
A number of manuals were published expounding the principles and 
practice of field experimentation. This work of the agronomists and 
statisticians had direct applications to large-scale surveys. 

(b) Mass production industry: Another source of complex sampling 
procedures was the development of inspection and quality control in 
mass production. In 1923, engineers at the Western Electric Com¬ 
pany, manufacturers of telephone equipment, began to apply prob¬ 
ability theory to the inspection of telephone exchange equipment. In 
the refinement of manufacturing processes and progressive adoption of 
more rigorous specifications, the cost of inspection had become very 
burdensome and sampling procedures offered possibilities of reducing 
the amount of inspection without serious loss of control over the 
quality of the product. Different sampling schemes were compared on 
the basis of their probability-of-acceptance curves and tolerances were 
established for the proportion of defective pieces in a lot. Thereafter 
sampling procedures were developed in more complex forms to achieve 
a reduction in the amount of inspection to the minimum necessary 
under the conditions of manufacture and the specifications to be met 
[56]. Technical problems more or less peculiar to the telephone industry 
made it the pioneer in the applications of mathematical statistics to 
industrial processes, including not only inspection but quality control 
and probability analysis of operating problems imder Rorty (as early 
as 1903), Shewhart, Dodge, Molina, Fry, and others [57]. 

(c) Cornmunication of new techniques: Those developments in agri¬ 
culture and engineering had both direct and indirect effects on sampling 
survey practice. They provided principles of design and contributed to 
the growth of applied mathematical statistics. Still there were many 
practical problems and obstacles that delayed the immediate extension 
of the methods developed for field trials and manufacturing to large- 
scale surveys. One of the obstacles was the relative lack of communica¬ 
tion between statisticians engaged in different types of work. An im- 
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portant step toward facUitating communication was taken by the Royal 
Statistical Society in 1933 when it formed the Industrial and Agricul- 
tmal Research Section and published the proceedings of meetings of 
the Section as a Supplement to the Society’s Journal. The Section 
brought together mathematical statisticians, engineers, agriculturists 
and others interested in practical applications of newly developing sta¬ 
tistical theory and stimulated greatly interest in sampling. E. S. Pearson 
presented a notable paper on "Problems of Industrial Sampling” at a 
meeting of the Section early in 1934 [58]. A few months later, Jerzy 
Nesrman read before the Society a comprehensive theoretical and criti¬ 
cal paper on stratified sampling and purposive selection. He cited 
among several examples a survey of the working population in Poland 
for which he had designed a stratified random sample of districts [59]. 
A number of other papers by various authors, published from time to 
time in the Supplement, included discussions of sampling practice. 

In this period interest in sampling practice was heightened in Great 
Britain by Shewhart’s visit in 1932 (a factor in the formation of the 
Industrial and Agricultural Research Section) and in America by the 
visits of Fisher, Neyman, Yates, and Cochran (1936-38) as well as by 
American students who studied in England. 

(d) Large-scale surveys: Another root of complex sampling practice 
is to be found in the progressive improvement of large-scale surveys of 
crop yields and acreages, the labor force, and other economic and 
social facts. A pioneering survey in India in 1923 by J. A. Hubback 
went almost unnoticed until Mahalanobis discovered and republished 
it in 1946 [60]. Hubback undertook to estimate the yield of rice by ob¬ 
jective measurements in the fields instead of by the methods based on 
the judgment of farmers and brokers. He used as sampling units very 
small areas measured off by a wooden frame in every field his agents 
found being harvested as they walked a fixed route from each selected 
center. Hubback was acquainted with “Student’s” and Bowley’s sam¬ 
pling work. Ilis report emphasized the advantage of random selection 
of many very small areas as economical and necessary to obtain a valid 
estimate of error. 

In the period 1928-31, J. 0. Irwin examined the British crop es¬ 
timating system and made recommendations for improving the 
sampling methods [61]. Subsequent work was done by Cochran. In the 
United States, W. F. Callander, J. B. Shepard, C. F. Sarle and others 
began to plan in the late 20’s for a sample of farms to be used for various 
studies of acreage, production, livestock, etc. Sarle made compreh^- 
sive apprmsals of the methods used to obtain farm-price data in 1927 
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and crop-yield estimates in 1932 and proposed a program of studies 
aimed at testing and improving the various sampling operations then 
in use [62]. Several studies of this type were made by state agricultural 
statisticians. 

In July, 1936, the Bureau of Agricultural Economics and Iowa State 
College held a conference on sampling attended by many of the state 
agricultural statisticians as well as staff from Washington and Ames 
[63]. R. A. Fisher who was lecturing at Ames participated in the discus¬ 
sions. The conference set forth a great variety of practical problems 
and outlined specific projects for research to help solve them. Some of 
these probjects were started at Ames in cooperation with the Buieau, 
some were undertaken in Washington. The program has been con¬ 
tinued up to the present time with additions and modifications. It has 
resulted in a remarkable series of repoits and developments of meth¬ 
odology [64] as well as the establishment of a sampling staff within the 
Iowa State College Statistical Laboratory at Ames under G. W. 
Snedecor, A. J. King, and R. J. lessen. 

During the same period, P. C. Mahalanobis and his associates at the 
Indian Statistical Institute in Calcutta developed large-scale sampling 
methods to a high degree of technical perfection and utilized them for 
surveys of jute acreage, agricultural finance, famine conditions, radio 
listening, and many other subiects. The theoietical system associated 
with these practical applications has been set forth fully by Mahala¬ 
nobis in his publications and lectures [65]. Mahalanobis has made careful 
experimental measurements of the human errors in sampling surveys 
by use of “interpenetrating samples” and other means. 

From 1936 on the Research Division of the U. S. Bureau of the 
Census xmder C. L. Dedrick made extensive studies of problems in 
population sampling, utilizing the 1930 Census returns. The results 
have not been published. In 1939, Stephan, Doming, and Hansen de¬ 
signed a sampling scheme that was used in the 1940 Population Census 
to draw a 5 per cent sample totalling 7 million people. This method was 
based on experimental tabulations of 1930 census data and on a varia¬ 
tion of systematic sampling [66]. The sample was used to obtain pre¬ 
liminary estimates of regular census statistics as well as tabulations of 
the special questions that were asked only of the people in the sample. 
Subsequent complete tabulations verified the accuracy of the estimates. 

During the same period experiments were conducted in a number of 
cities and towns hy the Research Division of the Work Projects Ad¬ 
ministration, under H. B. Myers and J. N. Webb, to test out methods of 
conducting sample surveys of unemployment. On the basis of these 



HISTORY 07 UODEBN SAMPUNO 


33 


trials a monthly nation-wide survey was started in December 1939 
utilizing a subsampling procedure in approximately 60 counties. Its 
scope was enlarged during the war to include houdng and labor ques¬ 
tions and in 1942 it was transferred to the Census Bureau where it was 
developed further as the Monthly Survey of the Labor Force [67]. 

"With the Monthly Survey and other sampling surveys required by 
the war agencies, the Census Bureau had a sizeable program of sampling 
work and developed, under Morris H. Hansen, a staff of sampling spe¬ 
cialists who designed sample schemes, and studied their performance in 
practice. Data from the 1940 Census for very small areas were made 
available for use in sampling and Sanborn maps showing every street 
and building in a large part of the cities in the United States were ob¬ 
tained [68]. In addition, airplane maps were obtained for a large 
part of the United States and procedures were developed for employing 
them in sampling. 

A further step was taken in the development of sampling facilities 
when the Bureau of Agricultural Economics established at Iowa State 
College in 1943 a project to design and prepare a Master Sample of 
farms for use in its surve 3 rs [69]. This Master Sample is a stratified se¬ 
lection of very small areas based on airplane photographs and hi^way 
maps. The Census Bureau joined in its development and had it ex¬ 
tended to include other than farm areas. 

These most recent developments in complex sampling procedures 
have been described very inadequately and many other recent develop¬ 
ments have been omitted for want of space. Other speakers on the pro¬ 
gram of the International Statistical Conferences will present the cur¬ 
rent stage of theory and examples of present day practice. 

VI. DBVELOFMBNT OF TTSB OF JUDOICENT SELECTION AND 
CONTROLS IN SAMPUNO 

All attempts to obtain data by sampling involve some choices based 
on the purposes of the study, knowledge of the material, and experi¬ 
ence with the methods of observing and recording similar data in previ¬ 
ous studies. For example Bowley’s choice of Reading for his survey in 
1912 and Clapham’s choice of certain fields at Rothamsted for his ex¬ 
periments on sampling design were not random selections. Yet the con¬ 
clusions drawn from such samples as these are commonly generalized 
to apply to other periods of time and other places. Some investigators 
have depended entirely on their knowledge and judgment to make the 
final selection of the items that constitute the sample, others have in- 
msted that judgment should only be used to subdivide the population 
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into strata and units of sampling, the actual choice of the sample being 
made at random. These views have been presented in various forms 
and the differences have not yet been fully resolved. Research is in 
progress to determine some of the facts and develop the principles 
that are involved in appraising the performance of various sampling 
procedures based on these conflicting views. 

Jensen, in his report in 1926, described a number of surveys made by 
“purposive” sampling in which districts were selected which, taken to¬ 
gether, matched the population in certain characteristics known from a 
previous census. The theory of purposive selection was developed by 
Bowley in 1926, and discussed by Jensen in 1928 [70]. Neyman com¬ 
pared purposive and stratified random sampling in 1934 giving special 
attention to experience reported by Gini and Galvani. Mangus de¬ 
scribed a variety of purposive sampling of counties used in studies of 
the FERA in 1934 [71]. 

The development of public opinion polling and market research in¬ 
volved many interesting problems of social psychology and survey 
procedure that fall outside the scope of this paper, but it also led to the 
use of many different sampling schemes, prominent among which were 
those termed “quota control” sampling. The earliest work was done 
with samples chosen to meet the convenience and very limited resources 
of the research personnel. In this respect they were like most other sur¬ 
veys conducted at the time. Robinson has reviewed in detail the history 
of straw votes and opinion polls to 1932 [72]. He also analyzed The Lit- 
erary Digest poll which, in spite of its great size was badly biased by its 
method of mail balloting. Several analysts were able to improve The 
Literary Digest results by relatively simple adjustments, [73] but, in 
field surveys in which interviewers are left free to use their judgment in 
selecting a representative group of respondents, a better procedure 
appeared to be to regulate the selection of the sample by assigning to 
each interviewer a quota of persons of various kinds so as to obtain the 
correct proportions of men and women, old and young, rich and poor, 
etc. in each city or district that was canvassed. This procedure was de¬ 
veloped by Cherington, Roper, Gallup, Crossley, whose sampling sur¬ 
veys of opinion became widely known after the election of 1936, and by 
market research and opinion survey organizations, whose studies re¬ 
ceived less publicity [74]. The general principles of these and related 
methods have been discussed by Brown, Franzen and Robinson, by 
Cassady, and in two symposia [75]. Quota methods have been em¬ 
ployed in the surveys of the National Opinion Research Center [76], 
the OflBlce of War Information and the British Wartime Social Survey 
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[77]. Most of the opinion polls and market research studies around the 
world at present employ quota methods or related procedures of sam¬ 
pling. These methods have been adopted partly by imitation of the 
more widely known surveys and partly because they are attractive with 
respect to costs and practical convenience. 

VII. GENERAL VIEW OP THE PRESENT PERIOD 

Current uses of sampling will be described by other speakers at this 
session. For the most part these uses were connected with the war 
which proved to be an even greater accelerating stimulus than the de¬ 
pression of the 30's. They included surveys of employment, housing, 
prices, shortages of consumer goods, dealer inventories, public reaction 
to wartime measures, characteristics of selectees, the attitudes of sol¬ 
diers, effects of bombing in England, Germany and Japan, public opin¬ 
ion in many countries, inspection of election registers in Greece, surveys 
of saving and spending, employee attitudes, reading and radio listening, 
social and psychological factors affecting fertility in the U. S. and in 
Britain, etc. In manufacturing and procurement, sampling methods 
were perfected for quality control and inspection of materials, parts, 
and finished products. Market research and opinion studies by private 
research agencies and business concerns developed rapidly, with con¬ 
siderable emphasis on measuring public relations and employee atti¬ 
tudes. The volume of work increased and many continuing or repetitive 
surveys were established, making it possible to spread the costs and 
hence devote greater resources to the design of each sample than had 
been possible for infrequent sampling surveys. 

Other speakers will describe the more recent developments that 
grew out of these earlier accomplishments. The Journal of the Royal 
Statistical Society and its Supplement, the Journal of the American 
Statistical Association, the Annals of Mathematical Statistics and many 
of the scientific and professional publications in various fields have pub¬ 
lished the increasingly numerous articles that reflect the progress of 
applied sampling. The reader should review them, beginning in 1940 or 
earlier, and search articles that report particular studies as well as those 
that are principally devoted to discussions of method. 

Great as these advances have been, they are only the forerunners of 
a broad expansion of sampling in the future. Further improvements in 
method are also to be expected. Much experimental work is being done 
within and outside government bureaus. The Committee on the Meas¬ 
urement of Opinion, Attitudes and Consumer Wants, appointed jointly 
by the National Research Council and the Social Science Research 
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Council, is making a comprehensive study of sampling methods as 
they are used in its field. The United Nations Statistical Commission 
has established a sub-commission on sampling which will contribute 
greatly to the development of sampling surveys in many countries. 
The International Statistical Institute can perform a great function in 
extending the use of sampling procedures in the many statistical ac¬ 
tivities throughout which it exerts its leadership and influence. 
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THE USE OF MARK SENSING IN A LARGE SCALE 
TESTING PROGRAM* 


Walter L. Deemeb, Jr. 

AAF School of Aviation Medicine^ Randolph Fields Texas 

A description is given of the use of mark sensing in the 
Psychological Program of the Army Air Forces. The prin¬ 
ciples of mark sensing are summarized and the mark sensing 
cards of the Psychological Program are exhibited. Some gen¬ 
eral principles are given for use in deciding on when to use 
mark sensing in a testing program. 

I T IS the purpose of this paper to describe the experience of the Psy¬ 
chological Program of the Army Air Forces in the use of mark 
sensing, so that this experience may be utilized by others who may, 
have problems similar to those we encountered during the war. 

There are five sections to the paper: 1) a brief history of the testing 
program in which mark sensing was used; 2) a statement of the prob¬ 
lems which we attempted to solve by mark sensing; 3) a description of 
mark sensing; 4) the plan that we devised; and 5) an evaluation of the 
plan with suggestions for future uses of mark sensing in large scale 
testing programs. 

During the early stages of the war, the testing of aircrew candidates 
for classification as Bombardier, Navigator, or Pilot was accomplished 
at three centers, one located at Nashville, Tennessee, one at San An¬ 
tonio, Texas, and one at Santa Ana, California. A central records 
agency was established at Ft. Worth, Texas, at Headquarters, AAF 
Training Command. At the testing centers there was administered to 
each candidate a battery consisting of twelve to sixteen paper and pen¬ 
cil tests, and five or six psychomotor tests. For a list of the tests admin¬ 
istered see Appendix A. The paper and pencil tests were administered to 
groups of 100 to 300 candidates; the psychomotor tests were adminis¬ 
tered to groups of four candidates, typically on the day following the 
group tests. Each psychomotor test required fifteen minutes to admin¬ 
ister. For purposes of administrative control, and for research it was 
desired to have recorded on punched cards in Ft. Worth all the scores of 
every candidate tested and also certain supplementary information, 
some of it for identification, such as name and army serial number; 

* This papa was oiigixially presented at the Educational Research Forum, Endioott, N. Y., 
August 28,1947. It reiuesents the ojnnion of the authoronlyand ie not to be conetniedaBrepreBentins 
the official ojnnion of the War Deportment or of the Army Air Forees. 
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some of it biographical, such as age, marital status, education and 
previous flying training; some of it for classification purposes, such as 
preference for types of trmning, status, phyacal qualifications, recom¬ 
mended classification, and final classification. 

In brief, then, the problem was to devise the most efficient and ac¬ 
curate means of getting all the above data on punched cards in Ft. 
Worth. The obvious method, of course, was to have the necessary 
rosters prepared at the testing centers and to pimch cards from these 
rosters. On the basis of descriptions of mark sensing, howev^, it was 
decided that this method m^ht be more efficient than rosters. Ef¬ 
ficiency was important because of the large testing load. An average of 
over 14,000 men per month was tested during the last half of 1942. 

Before describing the plan that was devised, it might be well here to 
summarize biiefly how mark sensing works. Figure 1 shows Group Tost 
Mark Sensing Card No. 1. The particular use of this card in the testing 
program will be described later. It will be described now as an example 
of mark sensing in generaL 

The mark sensing portion of this card begins at column 16. Each of 
the small ovals with numbers in the center is a mark sensing position 
and covers three ordinary punching columns. There are 22 mark sensing 
columns on this card. (A total of 27 mark sensing columns is the maxi¬ 
mum that can be put on a card.) There are twelve mark sensing posi¬ 
tions in each column. The ovals are immediately above the correspond¬ 
ing pimch portions. In use, the ovals are marked with a horizontal line 
by a special pencil. The cards are processed on the 513 Reproducing 
Punch, to which a mark sen^g unit has been attached. The mark sens¬ 
ing brushes consist of sets of three regular brushes. In each set of three 
brushes the outer brushes are common and the center brush is con¬ 
nected to a mark sensing brush hub on the plugboard. A mark is sensed 
when it bridges the gap between the center brush and either outer 
brush in that group. The marks are sensed conductively rather than 
photo-dectrically. It is necessary, however, to amplify the current 
through an amplifier unit using vacuum tubes as relays. 

The unit is completely flexible in that each mark sen^g column may 
be punched in any column on the card. The card in Figure 1, for exam¬ 
ple, has been laid out for the first two mark sensing columns to be 
punched in columns 8 and 9. 

The speed of mark senmng is the standard reproducer speed of 100 
cards per minute. Ten mark sensing columns can be sensed at one lime 
by one mark senmng unit. Completely to process the card in Mgure 1 
requires three runs throu^ a reproducer with one mark sensiag unit. 
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In order to verify that a mark has been sensed and punched, use is 
made of the double punch and blank column detection device, which 
automatically indicates a card which has a double punched column or a 
blank column. Since the blank column detection caimot be used with¬ 
out the double punch detection, verification in mark sensing is lim¬ 
ited to columns which arc designed for angle punching only, unless one 
of the double punches occurs in the 11 or 12 positions, and the other 
occurs in the 0-9 positions, in which case column splits may be used. 
The double pimch detection device does not detect double marks on 
the cards; it detects double punching of the cards. 

The original plan for applying mark sensing in The Psychological 
Program was devised by ColonelJohn C. Flanagan, who worked out a 
detailed plan for psychomotor testing. Each candidate was to be ^ven 
a different card lor each psj'-chomotor test. When he entered the room 
for the test he gave his card for that lest to the examiner, who on 
completion of the test marked the test scores on the card and filed it. 
The cards for this were planned and printed very early in the program, 
before it was known definitely just how many psychomotor tests would 
be used in the battery. Ten cards were prepared for each candidate in 
order to be sme to have enough. These were card forms like that in 
Figure 2, which is the number 9 card. 

The first five columns of each card contain an identifying number 
called the testing number. In column six is punched the number of the 
card in the set. The four mark sensing fields with a total of 8 mark 
sensing colmnns were lor the psychomotor test scores, which for each 
test consisted of a total raw score, a standard score, and one or two part 
scores. The four mark sensing fields are in the same position on aU ten 
cards. Each card must therefore be processed by itself, that is, all the 
munber 1 cards were mark sensed and the standard scores for test 1, 
marked in field A, were punched in columns 37 and 38. For test 2, field 
A was punched in columns 39 and 40, etc. The marks in the other 
fields were punched in other columns. After each set of cords was 
mark sensed all cards were sorted together on columns 1-6 and se¬ 
quenced on the collator, thus putting all cards of each man together 
and in order by test. They were then put through the reproducer and 
by interspersed master gang punching all the scores were combined in 
the 9-card. The 9-card thus became a summary card. The other cards 
were stored in semi-dead files, as reference to them was necessary only 
for verification of scores. All anal 3 r 8 es were performed on the summary 
card. When less than ten psychomotor tests were administered we used 
less than ten cards, but the 9-card was always the summary card. 
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Figure 3 shows a revised individxial test mark sensing card which 
allowed more space for recording scores. 

It was necessary to make complete plans for reporting scores before 
any actual scores were recorded in Ft. Worth, and in fact before the 
extensive testing program was started at the testing centers. After a 
good deal of discussion with IBM representatives, it was decided that 
it would be feasible to secure aU the test scores and miscellaneous other 
information by mark sensing. Accordingly two cards were designed for 
paper and pencil test scores. The first of these is that shown in Figure 
1 , and the other is Figure 4. There is room on these two cards for 17 
two-di^t scores. These cards have an advantage over the first psychom¬ 
otor test card m that the x and y podtions are available for marking 
scores that are unknown, making it possible to insist on a mark in every 
column, so that the blank column checking device could be used ef¬ 
ficiently. Each card could be mark sensed in three runs throu^ the 
reproducer. Card 1 was reproduced into card 2, making card 2 the sum¬ 
mary card. The fields numbered 1 to 17 were for test scores. The three 
two-digit fields labeled Bomb., Nav., Pilot, were for two-di^t composite 
scores giving measures of aptitude for each type of trmning. These 
compodte scores were based on weighted averages of paper and pencil 
tests and psychomotor fapparatus) tests. For ease of use, these two- 
digit scores were converted to single digit scores ruiming from 1 to 9. 
These single di^t scores, called stanines, were entered in the three fields 
near the right edge of card 2. The last column in card 2 had entered in it 
the amount of credit received by the candidate for previous flying 
training he may have had. The actual process involved in entering these 
scores will be described more fully later. 

The other two cards used in the testing program are the name card 
and the shd> card. The name card is shown in Figure 5. The first edition 
of the stub card is in Figure 6. The second edition is in Figure 7. 

The name card and the stub card were presented to the candidate 
at the first testing sesdon. They were connected by a small stub and 
both were pre-numbered and pre-punched with the testing number. 
Testing numbers w^ allotted in blocks to the testing centers, and 
•vrece assigned in chronological order to the cadets, so that a numerical 
order by testing number was also in order by testing center and by date 
of testing. The assigning of numbers was automatic since the candidate 
wrote his name on the name card and the stub and that number was 
thus asagned to him. His testing number was pre-printed and pre¬ 
punched on all his other mark sensing cards, both paper and pendl test 
cards and psydiomotor test cards. A pre-punched, pre-printed card 
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appears in Figure 2. The other cards shown in the figures are samples 
without the pre-printing and pre-pimching. 

The name card (Figure 5) did not require any mark sensing. The ca¬ 
det filled it in by writing the date, his name, army serial number and 
date of birth, and by encircling the appropriate entries for marital 
status, education, and previous flying experience. This card was de¬ 
signed for self coding; that is, the pimch operator started punching in 
column 41, with serial number, after date of testing was duplicated in 
columns 37-40 from a master card. As the card went through the punch 
the rest of the information became visible. 

The stub was tom off by the candidate after filling in name and 
army serial niunber, and was retained by him as a record of his testing 
number, which he wrote on all his group test answer sheets. Prior to the 
psychomotor test session the candidate turned in his stub and received 
his set of psychomotor test cards with the same testing number. The 
mark sensing information on the right of the stub card was filled in at 
the testing center after the candidate bad been tested and classific<l. 

All cards of the candidate were sent to Ft. Worth as they were com¬ 
pleted. The name cards were a source of information on the number of 
people who had started testing. The stub card information was some¬ 
times not available for some weeks or even months after the candidate 
was tested. The name cards for such candidates were kept in a special 
file and permitted a rapid count of incomplete processing. 

The mark sensing part of the stub card was completed at the testing 
center as the necessary information became available. The first’'edition 
of this card (IBlgure 6) was quite complex and illustrates a great many 
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of the pitfalls to be avoided in mark sensing. The first mark sensing field 
gave physical qualification. The Prev class space at the top of the first 
field was marked if the candidate had previously been classified else¬ 
where. The column of letters beginning with B to the right of the first 
field gave the labels to be attached to the rows of positions all the way 
to the right hand mar^: B for Bombardier, N for Navigator, P for 
Pilot, G for Ground, and the other letters for possibilities which might 
arise. Actually there did arise other classifications, such as flight en¬ 
gineer and gunners. The four coliimns in the field headed JND repre¬ 
sented the first to fourth preference of the individual for each type of 
training. For example, an individual whose first choice was Pilot, sec¬ 
ond NaAigator, third Bombardier and fourth Groimd would have had 
his card marked opposite the P in the first column, opposite the N in 
the second column, opposite the B in the thii'd column, and opposite 
the G in the fourth column. The rows are the aircrew positions and the 
columns arc the choices from 1 to 4. In the next field, headed P.R. U., 
wci’e given the four recommendations of the testing center (Psycho¬ 
logical Research Unit); the final classification by the Classification 
Board was entered in the field headed Boaid. 

The top throe rows contained special codes not related to those in 
rows 1 to 9. The row headed SP showed the strength of preference for 
the individual preferences. That is, it indicated how much importance 
the candidate wanted attached to his stated preferences. 

The line above that, headed Elim, represented tvpes of training from 
which the candidate may have already been eliminated. Many men who 
had been eliminated from Pilot training, for example, were tested for 
possible entry into Bombardier or Navigator training. The last nine 
columns in this row were not used. The Y row headed Prdb repre¬ 
sented any training which was recommended or assigned probationally. 

It will be seen that double punching on this card occurred in the four 
columns headed IND, and triple punching in tliis field was possible. 
Thus if the machine were wired normally, there would bo a stop on 
every card since at least one of the four columns would be double 
punched. The split column device could not be used to separate the 
punches because it splits between 11 and 0, and the double punch 
found here consisted of 0 and a punch from 1-9. Consequently, the 
double punch detection had to be omitted from this field. Since the 
double punch blank column detection device on the reproducer will not 
operate for blank columns without operating for double punches it was 
not possible to use dther of these checks. This turned out to be a very 
serious difficulty. 
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Mark sen^g is subject to two sources of error, one in the marking 
and one in the procesdng in the reproducer. In the stub card we were 
getting at least our share of errors from both sources. 

The directions for marking the stub card were prepared very care¬ 
fully, but even after some months of use we occasionally found errors in 
TOftrlfing due to failure to follow directions; this is not surprising in view 
of the complexity of the card layout. 

By far the largest number of errors in marking the card were due to 
carelessness, however, rather than to misimderstanding. We foxmd 
almost every conceivable type of error: coluions not marked at all, too 
many marks, imposdble entries, fields transposed. 

We found, too, that even correctly marked cards fairly often came 
out with missing punches or too many pimches. 

Because we could not use the blank column double ptmch detection 
device on the reproducer we had to find the errors by tedious methods, 
such as running lists on the tabulator for ^dsual checks. This was far 
from satisfactory either for accuracy or speed. 

We also experienced difficulty in procesdng many stub cards be¬ 
cause of their bad condition due to handling. It wiU be remembered 
that the candidate kept the card for one day. We had prepared a crease 
for folding the card so that it could be carried easily in a pocket. The 
creasing alone did not cause any great processing trouble. But the com¬ 
bination of creasing and handling and carrying in pockets by candidates 
often did mutilate the card so that it was difficult to process. We had 
further processing difficulties because of the serrated edge when the 
card had been detached from the stub. Unless constantly watched and 
checked the punches from the mark sensing would be slightly out of 
position. This would eventually cause trouble in the reproducer and 
collator. 

After a few months of use we revised the stub card. The revised 
card is shown in Figure 7. In the new edition all double punching and 
all blank columns have been eliminated. That is, each column can be 
marked correctly in only one place, and for every candidate there will 
be information for every column. 

One reason for revising the stub card was an administrative change in 
reporting preferences. The candidate was asked to show his strength of 
interest in each aircrew position on a one to nine scale, instead of rank¬ 
ing his preferences from one to four. Nine positions were therefore 
necessary for each category (B, N, P, other), making necessary a 
change in the card dimensions used for the positions. That is, where in 
the first card we had B, N, P as rows, we now have them as columns. 
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We could not change the dimendons of PRU and Board Classif cations 
because of the extra data which we had to insert. Hence we were left 
with three fields—^individxial preference, PRU recommendation, and 
Board Classification—all of which should lo^cally have mrcrew posi¬ 
tion as either rows or columns, but which actually had different di¬ 
mensions. This caused us some difficulty in preparing rosters for out- 
ade use. 

The cards on which the psychomotor test scores and the paper and 
pencil test scores were marked were much simpler than the stub card 
and caused less trouble in processing. Nor was there any apparent mis¬ 
understanding of directions for marking. Certain difficulties were en¬ 
countered with these cards, however. At first the accuracy of enteiing 
scores on mark sensing cards was a good deal less than the accuracy 
of entering scores on a roster. This was discovered by having two 
crews alternate on both methods of entering. The accuracy was im¬ 
proved by careful supervision, but all testing centers reported a very 
definite dislike on the part of the clerks for enteiing scores on mark 
sensing catds and this probably contributed to inaccuracy. 

It should be noted that the personnel who were doing the recording 
were well trained and highly competent. As a matter of fact, many of 
the clerks were enlisted men who were probably much more inteUigent 
(and moie interested in their work) than the usual clerks who would or¬ 
dinarily be used for this work. Perhaps the high level of intelligence (as 
shown in part by high Army General Classification Test scores) con¬ 
tributed to the dissatisfaction felt with the tedious process of marking 
the cards. 

We encountered some informal objection to mark senting from one 
or two senior officers who stated that a permanent original record of 
scores should not consist of marked cards but should consist of rosters. 
In the case of paper and pencil tests, this objection was unfounded, 
since the original record was the IBM answer sheet. In the case of the 
psychomotor tests, however, there was some merit in the objection, 
since according to the plan for use of the mark sensing cards no other 
record of the score would be kept. This objection was never com¬ 
municated formally. 

This objection to the use of mark sensing cards in psychomotor test¬ 
ing was reinforced by the need for keeping records for research pur¬ 
poses of many part scores on the psychomotor tests. There was no 
room on the mark sensing card for these scores, and furthermore the 
testing unit wanted a record of the part scores for its own use. After a 
short while, therefore, we found that what could have been a real labor 
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saving method had it been usable as planned, was becoming a burden 
because it was being superimposed on the old method; that is, score 
sheets for each psychomotor test were being prepared at the time of 
the test and the mark sensing cards were being filled in later. 

This duplication was soon evident in all the mark sensing operations. 
For local administrative purposes rosters of the infoimation on the stub 
card were needed sooner than they could be furnished from Ft. Worth. 
The result was that the examining centers were making rosters of the 
same information they were putting on the mark sensing cards. Since 
the centers were having difficulty in securing sufficient clerical help, tho 
extra labor of making mark sensing cards was a distinct burden. 

Certain difficulties were encountered with mark sensing in Ft. 
Worth. One difficulty was in storing the cards which were not used as 
summary cards. These consisted of one paper and pencil test card and 
nine psychomotor test cards, a total of ten cards per man. These cards 
were the ori ginal records which had to be referred to in case the scores 
on a summary card were questioned. They therrforc not only had to be 
preserved, but had to be readily available because many questionable 
scores were found on summary cards as they were processed. There was 
thus a large job of filing these cards. Furthermore, reference to these 
cards was tedious because of the bulk. Finding the score on a certain 
test for a certain individual was a rather long operation after thousands 
of men had been tested. 

The problem of original records had further ramifications. When a 
card was noispimched as it was mark sensed, either because of inac¬ 
curate marking or machine malfunction, a connet card was made. But 
the mispunched card had to be saved since it was an original record. 
There developed therefore an extensive file of “spoiled originals.’’ 
When a search of the main file showed a remade card, it was necessary 
to search the file of spoiled originals for the original card. 

Eventually, after a very thorough tryout, mark sensing was aban¬ 
doned, principally because of the difficulties at the testing units which 
have been outlined. 

The testing program which we were operating was dynamic. The 
test battery was changed two or three tunes a year, and other changes 
such as method of reporting preferences had to be made. The kinds of 
supplementary information which was needed changed from time to 
time. We foimd that new mark sensing cards incorporating the changes 
took a good deal of time to print. It was difficult to anticipate changes 
in the testing battery in time to have new cards prepared. 
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A summary of the factors which we found to be important in the 
use of mark sensing is given below: 

First, and probably most important from the operational side, is 
complexity of layout. E a complex layout is imavoidable it should be a 
warning signal against mark sensing. Complexity of layout may con¬ 
sist either in complex codes which lead to difficulty in the marking, or 
in double punches and blank columns which lead to difficulty in the 
processing. 

Secondly, mark sensing is contra-indicated when a permanent 
original record is necessary. There are two reasons for this: first, the 
cards are more bulky to file than typed rosters, and, second, the cards 
are hard to establish as original records in cases where scores must be 
verified for agencies outside the testing program. 

A third point to consider is whether a record in other form than a 
mark sensing card is needed before the cards can be processed and a 
roster prepared by machine. If such a record is needed it may be more 
economical to use this record for manual punching. 

A foiuiih point is the time necessary for revi^g cards in the case of a 
dynamic program. K changes are going to be at all frequent, a mark 
sensing s^'stem may be a liability, since a change can be effected no 
faster than new cards can be designed and printed. 

I want to emphasize and make explicit a point that has been implicit 
throughout the preceding discussion, namely, that our final decision 
to give up mark sensing is not a refection on the value of mark sensing 
vhen it is used under conditions for which it is suitable and for which it 
has been designed. I have felt only that by presenting our experiences 
it would be possible to delineate more exactly some conditions for which 
mark sensing is not suitable. 


APPENDIX A 

Tests used in the Aviation Psychology Program of the AAF for the 
selection and classification of Aviation Cadets. 

For a complete description of the tests and of the testing program 
see the Research Reports of the Army Air Forces Psychological Pro¬ 
gram, U. S. Government Printing Office, 1947. 

Paper and Pencil Tests 

1. Technical Vocabulary 

2. Speed of Identification 

3. Mechanical Principles 
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4. Mechanical Infarmation 

5. Mathematics 

6 . Arithmetic Reasoning 

7. Numerical Operations 

8 . Spatial Orientation 

a. Matching aerial photographs with other aerial photographs 

b. Matching aerial photographs with maps 

9. Reading Comprehenrion 

10. Dial and Table Reading 

Psychomotor Tests 

1 . Complex Coordination 

2 . Discrimination Reaction Time 

3. Two Hand Coordination 

4. Rotary Pursuit 

5. Aiming Stress 

6 . Finger Dexterity 

7. Rudder Control 



A PROPOSED BASIC COURSE IN STATISTICS 

George W. Snbdecor 
Professor of Statistics, Iowa State College 


INTRODT7CTION 


A BASIC introductory course in statistics was advocated by the Na¬ 
tional Research Council Committee on Applied Mathematical 
Statistics in its stimulating report of May, 1947, Reprint and Circular 
Series, Number 128. The functions of such a course are stated as fol¬ 
lows: “First, it should form part of a general education and as such it 
should be self-contained. Secondly, it should provide essential training 
for students majoring in the natural and social sciences, which may be 
developed further in later courses. Finally, it should interest promising 
young students in statistics as a profession.” 

Concerning laboratory work in statistics, the Report contains these 
penetrating remarks: “Often the main objective of such laboratories is 
the calculation of means, variances, correlation coeflScients and other 
statistical quantities from numerical data of various types. The labora¬ 
tory work should place more emphasis on interpreting or drawing in¬ 
ferences from data and on the nature of those inferences. Simple ex¬ 
periments should be devised for illustrating probability laws of various 
kinds, for carrying out sampling operations and other random processes. 
The traditional flipping of coins and rolling of dice are not adequate for 
illustrating many important random processes. The mathematical 
theory of many of these processes is too complicated to be handled at 
an elementary level; their experimental demonstration will give the 
beginning student some feeling for their significance.” 

The writer finds himself in entire agreement with this report of the 
Committee on Applied Mathematical Statistics. He finds, also, that 
many other teachers realize the need for revision in the traditional 
methods of presenting the subject. These statisticians are not only 
striving to reorient their own ideas but are experimenting with new 
methods of teaching. It seems timely to propose a concrete program 
which may serve as a springboard for discussions. 

Two merging trends emphasize the necessity for changes in the teach¬ 
ing of statistics. One trend is the astonishing growth in the impact of 
statistics on society. For many years after the American Statistical 
Association was founded in 1839, popular knowledge of the subject was 
confined almost entirely to governmental statistics and the data of 
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economics, though insurance was beginning to enter its present domi¬ 
nating position in our social structure. Today statistics has captured 
the popular fancy. The extensive data on the sports pages, the various 
opinion polls, popularity tests of many kinds including those of radio 
programs, the cost-of-living indexes with their repercussions on wage 
policies, surveys of consumer preference, crop estimates, quality con¬ 
trol—^these are some outstanding examples of the preoccupation of 
people with statistical evidence. 

The second trend merging with the first is the no less astonishing 
growth of statistical theory. Estimation and the testing of hypotheses 
have clothed with living flesh the dry bones of numerical data. The 
emphasis on sample-to-population inferences has put new meaning into 
statistical terminology. The Fisherian concept of “information” has 
shifted interest from the formal rules of calculation and summarization 
to the vital processes of getting information into the data by use of ap¬ 
propriate sampling and experimental designs. 

The mer^ng of these two trends into a single stream makes new de¬ 
mands on our statistical personnel. Neither the isolated theorist nor 
the submerged practitioner is able to keep abreast of the current. The 
specialist in mathematical statistics must acquaint himself with practi¬ 
cal problems, and equally, the work-a-day statistician must familiarize 
himself with the logic of statistical theory: fortunately, this is being 
made more readily available to those who do not have the benefit of 
mathematical symbolism. It is only by the intermixture of the two 
streams that statistics can freely flow onward. 

The fact should not be overlooked that the recent upsurge of interest 
in statistics is based on an ancient pattern of thinking, a form that has 
developed along with the thought process itself and that is more primi¬ 
tive and more extensive than the logical forms. Concepts of type and 
of departure from type are counterparts of statistical averages of 
location (means, regression coeflScients, etc.) and of measures of scale 
such as the interquartile interval and standard deviation. The process 
of sampling is deeply embedded in human actions. Judgments about 
probability together with consequent behavior determine much of the 
pattern of our daily lives. Anyone entering the profession of statistics, 
especially the teaching of it, should enjoy the confidence that he is 
engaging in a fundamental activity of mankind. 

Since statistics is so intimate a part of our social organization, the 
conclusion is inevitable that it should be taught generally to our young 
people. As indicated in the National Research Council Committee 
report, the elements will doubtless be introduced into the high school 
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curriculum as soon as there is a sufficient supply of trained teachers. 
Meanwhile, the basic course, taught at the freshman or sophomore 
level, should serve as the foundation of college curricula to produce such 
teachers along with other professional statisticians. 

Essentially, my idea is to bring the student into awareness of and 
harmony with the statistical content of our society. This content is 
extensive. It includes gossip, news and probability theory; sports and 
old age pensions; gambling and weather prediction; birth rates and 
living costs; the stock market and epidemics. Such large-scale ac¬ 
tivities as insurance and the census must be integrated with the more 
academic concepts of probability, distributions, sampling, estimates 
and tests of hypotheses. 

In working out the following syllabus, the author has set himself 
the ideal of presenting sound statistics in an interesting fashion. It 
would seem inadvisable to introduce so vital a subject in an austere 
and forbidding style. On the other hand, one must be constantly on 
guard lest he produce false impressions that will later have to be 
tediously eradicated. Only the elements of statistical thinking should 
be incorporated in this basic course, but the elements should constitute 
strate^caUy chosen timbers that will fit readily into the projected 
structure. 

It is a deep-seated conviction of the author that the tools of statistics 
should be brought out and sharpened only after the need for them has 
been felt. Tables, graphs, and calculations of the various averages have 
all too often been presented as the subject matter of statistics rather 
than as its implements. The very yotmg people whom we wish to enlist, 
those with imagination and intellectual enthumasms, may be alienated 
by dull routine if introduced before any necessity for it becomes evi¬ 
dent. 

It is clear that no more than an outline of a basic course can be sug¬ 
gested now. The outline will be changed and filled in by the efforts of 
many teachers who will gain experience from trying the experiment. 
Some have already started; the important thing now is to get more 
people working at the job. 

PROPOSED OTTTUKE OP COURSE 

I. Introduction. The interest people show in counting and measuring. 
Some historical items showing the antiquity of the habit. Statistical 
form of much human thinking. Uncritical attitude of people toward 
numerical statements. Illustrate fallacies. Interesting and uninteresting 
statistics. Illustrative material: batting averages, birth rates, election 
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returns, opinion polls, vital statistics, the average man. 

II. Inquiry by Sampling. Propose a contract to find out for a radio 
station the number of listeners to Program X. Develop limitations in 
extent of inquiry and in number of units interviewed. Discuss design 
of sample, including size. Quota sampling, random sampling, area 
methods. Use of mail and telephone. 

Assume sampling completed and ballots counted. Contrast known 
fraction of listeners in sample with unknown fraction in the population. 
Take confidence interval from table and explain meaning. Elxpand to 
population total number of listeners, using census information about 
area sampled. 

This sampling problem has been introduced ahead of the chapter on 
a priori probability for three reasons: (i) It is the newer and more prac¬ 
tical concept; (ii) It leads inamediately into a modern social problem 
instead of into the more ancient and perhaps less honorable one of 
games of chance; (iii) The term population has its obvious and funda¬ 
mental meaning, sample-to-population inferences bmng inevitable. 
Experience has convinced me that the student feels himself on familiar 
ground. 

Laboratory: Conduct opinion poll sampling of student body; con¬ 
struct questionnaire, design sample, discuss interview technique, tabu¬ 
late results, make iiderences about population. Students enjoy this ex¬ 
perience, ask many penetrating questions and open opportunities for 
sound instruction. 

III. Random Sampling from Population wiOi Known Constitution. 
Use bowl containing equal numbers of beads of two colors, or use table 
of random numbers with equal numbers of odd and even digits. Draw 
many samples of 10, recording numbers of “successes.” Contrast sam¬ 
ple ratios with that of population. Set confidence interval from each 
sample and determine proportion of correct statements made. Compare 
results with theory. Emphatize sampling variation and observe im¬ 
proved reliability when samples are combined into larger samples. 
Compare with poll of radio listeners where parameter is unknown. 

Laboratory: Extend sampling experience by use of coins or dice. 
Record result of each toss for later use. 

IV. Frequency Distribidion. Tabulate numbers of samples with 0 to 
10 successes. Present distribution graphically. The mode as an estimate. 
The small fraction of extreme values as an explanation of confidence in 
sampling. The mean number of successes in the aggregate of each stu¬ 
dent’s sampling. Emphasize the greater reliability and utility of the 
mean as compared with the mode. Combine samples of 10 into larger 
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groups and observe (i) the greater variation in the number of successes 
and (ii) the lesser variation in the proportion of successes. Law of Large 
Numbers. 

V. Insurance. Next to government, the greatest cooperative social 
enterprise in America. Historical items. The premium, the expectation 
of loss, is the price of protection equally enjoyed by all participants, 
life insurance and mortality tables. Expectation of life. Emphasize 
definite statements about uncertain events. Pure premium for single 
year term insurance. Level premiums for term and life policies. In¬ 
surance vs. savings accounts. 

Laboratory: Calculating machines will be required to compute vari¬ 
ous premiums. 

Insurance is now a social problem of great importance. The soundness 
of the trend away from pure insurance which is predominantly statisti¬ 
cal, towards various forms of savings devices which are mainly financial, 
is questionable. The student should be able clearly to distinguish be¬ 
tween insurance and investment. 

VI. A Priori Probability I: Games of Chance. A primitive, universal 
human interest. Probability and expectation. Conditions of fair play. 
Effects of limits on resources and time—sample vs. population. Playing 
against The House. Systems of play. Probabilities of runs. 

Laboratory: Try a system like the Martingale, noting winners and 
losers at end of specified number of throws. Balance accounts in the 
aggregate. Work out probabilities in some game. 

This is one of the chapters that affords great opportunity for ex- 
poting popular fallacies. 

VII. A Priori PrbbcibiUiy II: The Binomial Distribution. Develop 
first with probability of J. Compare sample distributions with theo¬ 
retical. Raise question of testing hypotheses. Develop distribution with 
probability different from f. Skewed distributions. Mean and mode. 

Laboratory: Throw dice to get data for samples for asymmetrical 
binomial. Compare samples with population. Save data for testing in 
next chapter. 

VIII. Empirical Probability. Prevalence of vague judgments and ac¬ 
tions based on them. Sampling basis of most calculated probability. 
Sampling from populations with unknown probability. Illustrate by 
thro\ving a loaded die. Calculate chi-square for hypothecs of perfect 
balance. Get distribution of chi-square from samples of 10. Test of 
hypothesis. Meaning and use of table of chi-square. Contrast samplings 
from populations with known and unknown probability. Emphasize 
the more practical problem of sampling from unknown probability 'tnth 
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resulting inferences and tests. Ordinarily the parameter is forever im- 
known, but pertinent hypotheses intrude themselves. 

Laboratory: Extend the experimental basis for the chi-square dis¬ 
tribution. Test an ample number of hypotheses about various samplings 
that have been made. Extend to more than one degree of freedom. Test 
the goodness of fit of samples from binomial distributions. 

This chapter is the climax of the first part of the course. A substantial 
body of statistical theory has been accumulated together with some 
practical problems and a number of socially advantageous applications. 
Sampling from specified populations has been emphasized. The bi¬ 
nomial distribution has been developed. A sampling distribution of chi- 
square has been built up, leading to confidence in the table. Estimates 
and tests of hypotheses have been justified and applied. Uncertain in¬ 
ference has been exemplified. If the student ends his contact with the 
course here he will have had experience with the fimdamcntal concepts 
of statistics. 

IX. Measurement. Start with guesses at the length of an 18-20 inch 
bar. Distribution of guessed lengths. Measure mth scale and sum¬ 
marize in distribution. Emphasize variation as characteristic of all meas¬ 
urement. Parameter is again always unknown. Develop idea of normal 
distribution as a model like the binomial. Properties of the normal dis¬ 
tribution. Mean and standard deviation of sample as appropriate 
estimates of parameters. Methods of calculation. Repeated measure¬ 
ments of the same thing compared to measurements of the members of 
a sample from a normal population. Emphasize conceptual character 
of models and impossibility of learning parameters by sampling. 

Laboratory: Let all measure height of one member of class. Distribu¬ 
tion of heights of men and women. Perform some psychological experi¬ 
ment leading to near-normal distribution. 

X. Sampling DishibvMam. Each member of class draws 10 or more 
samples of 10 from a normal population, using table of random digits 
for randomization. Mean and variance are calculated for each sample. 
Tabulate distribution of each, showing normality of first and skewness 
of second. Estimate population mean and variance from each. Calcu¬ 
late t for each sample, using known population mean. Distribution of 
t. Calculate confidence interval based on each sample and verify frac¬ 
tion of correct statements. 

Laboratory: Construct normal curve from parameters of sampled 
population. Pit normal distribution to distribution of means. Test 
normality of distribution of variance. 

XI. SampLing From Some Human PopvAoMon. If time and facilities 
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are available, this may be an actual sampling. Usually this chapter 
will be limited to tabulation, summarization and presentation of avail¬ 
able data from some sample survey. Calculate mean and variance. Test 
normality of distribution. Confidence intervals. Expand estimates to 
population totals. 

Laboratory: Most of this chapter is the laboratory type of work. 

XII. Non-Normal DislribvMons. Rectangular and skewed. The me¬ 
dian as an estimate. Sample distributions of mean and median. Em¬ 
phasize that no known actual distribution is normal. Use of median and 
rdated order statistics. 

Laboratory: Draw complete set of samples from some small rectan¬ 
gular populations showing central tendency of means. 

XIII. Regression. Growth curves and economic trends. Calculation 
of linear regression on several assumptions. Deviations of individuals 
from trend; case studies. Estimates, confidence statements and tests of 
hypotheses. 

Laboratory: Get results of aptitude tests and college grades. Es¬ 
timate latter from former. Each student calculates his own deviation. 

XIV. Index Numbers. Construct a simple cost-of-living index with 
direct economic meaning—^the changing cost of a specific bill of goods. 
Emphasize necessity of examining computation before attaching mean¬ 
ing to an index. 

Laboratory: Construct cost-of-student-livii]^ index in your college. 
Continue from year to year. 

XY. Correlation in normal bivariate population. Estimates. Confi¬ 
dence interval and test of h 3 rpothesis p=0. Mental tests. Inheiited 
characteristics. The variance of differences. Correlation and regression. 
Rank correlation in samples from non-normal populations. 

Laboratory: Construct sampling distribution of r in small samples. 

XVI. Sampling from More Oian One Population; e.g., men’s heights 
and women’s. Combining estimates—^under what circumstances is it 
appropriate? Stratified sampling. Estimates of mean and variance— 
wei^ts. Analysis of variance in groups. 

Laboratory: Draw samples and verify various estimates. Distribu¬ 
tion of F. 

The foregoing chapters constitute a course occupying two quarters. 
V^th omission of chapters V, XI, XIV, and the latter parts of some 
other chapters, a semester’s course is available. For a third quarter, 
one or more of the remaining topics may be expanded to suit special 
groups; quality control for en^eers and industrial economists, assays 
for entomolo^sts, etc. 
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XVII. Statistical Instruments of the Federal and State Governments, 
The census with population studies. Public health and vital statistics. 
Marketing and other economic statistics. Crop estimates. Relief and 
security statistics. 

XVIII. Financial and Business Statistics. Markets and market rec¬ 
ords. Trends and forecasts. Consumer acceptance and preferences. 

XIX. Quality Control. Historical items. Control chart with statisti¬ 
cal features. Contracts involving control contrasted with those pro¬ 
viding inspection. Sequential test. 

XX. Assays. Governmental regulations. Pure food and drugs. In¬ 
secticides. Vitamins. Purity of seed. 

XXI. Experimentation. Science and experiments. Alternation of in¬ 
duction and experiment. Control of extraneous variation. Groups and 
“randomized blocks.” Analysis of variance. Statistical control—co- 
variance. Broadening the basis of inference—factorial design. Inter¬ 
action. 

XXII. The Statistical Attitude. Group vs. individual. The usual vs. 
the exceptional. News vs. everyday life. Detection of fallacies. Prob¬ 
ability vs. certainty. Evidence vs. proof. 

The mathematical accompaniment of such a course can be adapted 
to the training of the students. It may vary from simple algebraic 
derivations of formulas with practice in the notation to a full course 
in mathematical statistics. Since the logical concepts of statistical the¬ 
ory can be presented and verified experimentally, without the symbol¬ 
ism of mathematics, it is the author’s opinion that the mathematical 
formulation should follow some more general presentation such as that 
outlined in this syllabus. 

On writers of texts for this course will rest the heavy responsibility 
of making their appeal directly to the student. At present there are not 
enough teachers to go around. An instructor’s manual containing back¬ 
ground material (such, for example, as the data from a consumer’s 
preference survey) would seem to be a necessity, since this kind of 
source material is not available to many. In fact, such a manual may 
well be expanded into a book on the teaching of elementary statistics. 



ACTUARIAL ESTIMATES FOR PUBLIC SICKNESS 
INSURANCE PLANS 


Abraham M. Nirssen* 

TJ, 8. Railroad Retirement Board 

Proponents of social insurance consider sickness insurance a 
logical extension of the unemployment compensation pro¬ 
gram. Two state plans and one Federal sickness insurance plan 
are now in operation, and a number of other states are giving 
serious consideration to this matter. 

This paper explores the possibility of making cost estimates 
for public sickness insurance plans along established actuarial 
lines. The use of the actuarial approach may be of importance 
when long-range estimates are desired. 

Appropriate actuarial techniques are presented for a typical 
public sickness insurance plan. The method is theoretically 
simple, and does not require the use of statistics which are not 
ordinarily available. The details would depend on the pro¬ 
visions of the plan and on the purpose of the cost calculations. 
The general method is sufficiently flexible to be adoptable in 
various situations. 

I N RECENT YEARS a number of states have considered the introduc¬ 
tion of sickness insurance plans for workers covered under the un¬ 
employment compensation schemes. Two states, namely Rhode Island 
and California, actually enacted laws to this effect. A similar develop¬ 
ment took place with respect to railroad workers for whom a nation¬ 
wide sickness insurance program was established by the amendments 
enacted in 1946. The methods used in connection with this type of in¬ 
surance naturally fall in two categories depending upon the purposes 
at hand. For short term cost estimates, methods commonly employed 
by statisticians and economists may be advisable. However, for long 
range estimates actuarial methods would seem preferable. 

To the knowledge of the author no actuarial technique has as yet 
been developed for long range cost estimates in connection with large 
public sickness insurance schemes. This paper constitutes an attempt 
to develop new techniques along lines similar to those used in pension 
fund work. It is believed that the method here presented is in principle 
simple and that it can be readily applied without recourse to any spe¬ 
cial statistics that are not usually available. It is of course recognized 
that at the present time none of the public insurance plans has been in 

* The opinions expressed in this paper are those of the author and do not necessarily represent the 
official views of the Railroad Retirement Board. 
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existence long enough to furnish experience upon which reliable ac¬ 
tuarial estimates could be made. Nevertheless, an early discussion of 
the problem may lead to an exchange of ideas which would result in the 
establishment of a uniform method for the valuation of plans of this 
tsrpe. It is in the spirit of starting off such a discussion that this paper 
has been written. 


GENEBAL CONStDEEATIONS 

Sickness rates are known to depend on age, race, sex, occupation, 
geographical location, employment opportunities, and to some extent 
also on the duration of coverage. Other factors to be considered are 
variations by calendar year and secular trends. There is considerable 
difference between making estimates for a "going” system as compared 
with a new one, since the tickness experience of different programs will 
generally be affected by the relation of benefits to wages, administra¬ 
tive procedures, and by the existence of other related social insurance 
programs. From a short range viewpoint most or all of these factors 
are important. However, when a long range cost estimate is attempted 
it is practical to consider only the most important factors. It is be¬ 
lieved that a sufficiently reliable estimate may be obtained by consid¬ 
ering only variations by age, sex, and race. This approach would seem 
particularly justifiable in the case of a compulsory state-wide plan pro¬ 
vided that experience has been available for a period long enough to 
be considered representative. Even preliminary estimates can be de¬ 
veloped by using actuarial methods. In such a case, the basic rates and 
averages would have to be estimated on the baris of available richness 
rates and previous estimates made in connection with the operations 
of the unemployment compensation program. 

The main advantsges of the method presented in this paper lie in 
the fact that the calculations can be performed with a ininiTnum of 
statistics, and that provirion will be made for the effect of reserves. It 
is recognized, of course, that a sickness insurance plan in which the 
ri^t to benefits terminates not later than two years after the year of 
last employment does not have to cope with the problem of deferred 
benefits. To this extent, the question of reserves is of much smaller 
rignificance than in a retirement plan. However, even in public sick¬ 
ness insurance plans reserves are necessary in order to prevent frequent 
changes in the level of contributions, and to provide a safety margin for 
contingendes. 

The mathmatical development presented in this paper generally 
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follows along the lines of actuarial techniques established for the valu¬ 
ation of large self-insured pension funds. Because of the different nature 
of the problems involved many significant modifications had to be 
made. In pension fund work it is customary to consider not only the 
age of the employees but also the duration of their service. This is 
taken care of by the introduction of actuarial factors (such as with¬ 
drawal rates, wage scales etc.) which depend on both age at entry and 
duration. It is usually convenient to consider the effect of the duration 
for only a few years which are then said to constitute the select period. 
All longer durations are then combined and the corresponding calcula¬ 
tions proceed by attained age only. The length of the select period is a 
matter which has to be determined on the basis of preliminary studies. 
It is not certain that the duration of covered employment will have a 
considerable effect on the experience of a sickness insurance program. 
It was felt, however, that a theoretical discussion of actuarial tech¬ 
niques for such programs should indicate the procedures which would 
have to be followed in case it is decided to make allowances for the 
effect of the duration of substantial employment. For this reason, it 
was deemed advisable to introduce a select period of one year. It would 
obviously be an easy matter to modify the mathematics so as to elim¬ 
inate the select period altogether, or to introduce a select period of 
longer duration. 

In order to avoid trivial detail the assiunption was made that the 
reader is familiar with tabulating procedures and that he is acquainted 
with the rudiments of general actuarial theory. For the benefit of the 
reader who might wish to refresh his memory on some particular topic 
a few references have been appended at the end of the paper. Reference 
(e) has been included mainly because the author of this paper is in¬ 
debted to Mr. Rusam for a few ideas regarding the construction of 
actuarial functions for sickness insurance calculations. 

Since it was desired to present a general outline of the proposed 
method, the convenient assumption was made that the only break¬ 
downs will be those by age and duration. If other breakdowns, as by 
race, sex, and occupation, are deemed necessary, the method would 
have to undergo some slight modifications especially in the part dealing 
with the level cost calculations. 

DESCBIPnON OF THE FLAN 

The sickness insurance plan under discussion is assumed to have the 
following basic characteristics. 

1. Participation is compulsory on the part of all workers covered 
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under the state unemplo 3 nnent compensation program. No 
other individuals are admitted. 

2. The plan is of the monopolistic type, that is, no substitution 
of private group insurance is permitted. 

3. Contributions are in the form of a level percentage of taxable 
wages A^ithout differentiation by age, sex, race, or occupation. 

4. No experience rating is anticipated. 

5. The plan is to be financed on an actuarial basis including the 
maintenance of proper reserves. 

6. The only requirement for insurability is the earning of a 
stated minimum of wages in a given calendar year referred to as 
the “base year." This stated minimum of wages is thereafter in the 
text referred to as “qualifying wages.” 

7- Workers insured at the end of a given base year are eligible 
for benefits in a succeeding one-j^ear period referred to as the 
“benefit year.” Under the Rhode Island plan, for instance, the 
base year is a calendar year and the benefit year is a 12-month 
period beginning in April of the next calendar year. Throughout 
the benefit year the individual benefit rate remains constant 
depending on the worker’s earnings in the base year or in a 
specified portion thereof. 

8. There is an initial waiting period in each benefit year. Bene¬ 
fits are paid on a weekly or daily basis up for a specified maximum 
duration. 

9. The plan pays cash benefits only when there is a loss of wages 
as a result of sickncas. However, the benefit rate depends only 
on the total creditable wages in the base year. No medical ex¬ 
penses or death benefits are included. 

10. The amoxmt of benefit does not depend on the worker’s 
marital or parental status. In other words, there are no family al¬ 
lowances. It is also assumed that the benefit payments have a 
uniform duration for every qualified worker. 

STATISTICS RBQXnRED 

It is assumed that with respect to each worker the following data 
will be available. 

1. Social security account number. 

2. Sex, race, and occupation. 

3. Year of birth. 

4. Taxable wages in the base year with special provisions made 
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for showiiig quarterly earnings if the benefit is based on earnings in 
a specified quarter rather than in the whole year. 

5. Calendar year of experience (base year). 

6. Worker’s status as of the end of the preceding base year 
(insured, not insured). 

7. Worker’s status as of the end of the base year under con¬ 
sideration (apparently insured, not insured because of failure to 
earn qualifying wages). 

The above is the minimum information to be punched in the wage 
card. In addition, a claim card is to be maintained with respect to each 
worker who received benefits in the benefit year corresponding to the 
base year. This card should contain at least items 1, 2, 3, and 5 of 
the wage card and in addition 

(a) The benefit rate, and 

(b) The total of benefits received during the benefit year. 

It is believed that such information as listed above, although inade¬ 
quate for a proper anal 3 rsis of the sickness experience, will be sufficient 
for an actuarial estimate along the lines developed in the following 
sections of the paper. It should be noted that in addition to the analy¬ 
sis of the available statistics it will be necessary to prepare a long range 
estimate of taxable pay rolls by calendar year. 


DEFIHITrONS AND NOTATIONS 

Throughout the reminder of the paper the following definitions and 
notations will be used: 

n Calendar year corresponding to the base year imder 

consideration. 

X Mean age in the base year, equals n minus year of 

birth. 

Number of workers bom in n—x who earned qualifying 
wages in the two base years n—1 and n. For con¬ 
venience, this group will be referred to as "insured 
permanent employees.” 

Number of new entrants in the base year n at mean age 
X. Denotes workers bom in n—x who earned quali¬ 
fying wages the base year n but not in »—1. 

Number among the E^ who worked in the base year 
n-f-1 but earned less than the minimuTn qualifying 
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Similar to except that it refers to the Nl 

workers. 

(rE)^^ Number among the El who had no wages in the base 
year n+l. 

(rN)'^^ Number among the Nl who had no wages in the base 
year «+l. 

(SE)l Total taxable wages in n earned by the El workers. 

(SN)l Total taxable wages in n earned by the Nl workers. 

(iSO)" Total of wages in n earned by all workers who failed to 
earn in that year qualifying wages. 

P" Total taxable pay roll in n. 

iBE)l Total sickness benefits paid during the benefit year 
corresponding to the base year n with respect to the 
El workers. 

(BN)l Total sickness benefits paid during the benefit year cor¬ 
responding to the base year n with respect to the 
Nl workers. 


COMPUTATION OP RATES AND AVERAGES 

It is assumed that the statistics described above are available 
with respect to a period of several base years, say, from c to d. The 
symbol 2^ denotes summation from n=c to n=d. B^efit data ob- 

n 

viously need be available for the benefit years corresponding to these 
base years, which will thus extend the period for which statistics must 
be available for a year and a fraction beyond the calendar year d, while 
at the same time the benefit data for the first year and a part of the 
next yeax of the period will be ignored. The total probabilities of loss 
of insured status, gj, for new entrants and permanent employees 
respectively are: 

Z[(v>n)T +mT] 

n 

L [iwE)T + {rE)T ] 

g. (la) 
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The averugc taxable wages, «*, by age in the base year, again with a 
select period of one year, are: 




n 



n 


E (SE): 


n 



n 


( 2 ) 


(2a) 


The average benefit payments in the benefit year, 6*, per individual 
age X in tlio base year who coxild be eligible for benefits with respect to 
that base year, are then computed by means of the following formulas: 




( 3 ) 


E ml 



n 


(3a) 


A set of r* s, giving the proportionate age distribution of new entrants, 
is obtained from the formula: 


sew; 


( 4 ) 


Finally, a rulio g, which represents the portion of the total taxable pay 
roll which gives no rise to any benefit pasunents, is computed from: 


E(-S0)» 

n 


( 5 ) 


THE SERVICE TABLE AND ACTUARIAL FUNCTIONS 

All functions fl) through (5) can eaaly be obtained from proper 
tabulations. The rates and the averages may first be computed for quin¬ 
quennial age groups. The values by single age can then be obtained by 
one of the several standard interpolation methods. 
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At this juncture it becomes necessary to construct a service table. 
In the case of a pension fund, the service table would take separate 
account of exits from service by reason of death, withdrawal, dis¬ 
ability, and retirement. If we denote the tabular number in service at 
age a: by 5 the ‘in service survivors’ a year later or would be ob¬ 
tained by subtracting from ^ all exits between ages x and x+l as 
computed on this base. As an illustration, let us assinne that ^ 
=3 100,000 and that the total of the probabilities of exit is .2. Then, 
the computed number of exits between ag^ 25 and 26 is 20,000, and 
^=80,000. For the purpose of the development presented in this 
paper a special definition of exit from service was introduced. Here, an 
individual is considered to have left covered emplosrment if for any 
reason whatsoever he failed to earn qualifying wages in a ^ven base 
year. The combined probabilities of such exit are defined by equations 
(1) and (la). Our particular service table is then constructed as 
follows: 

Be^ with a selected young age, say, 15. 

Put luB, = 100,000 
Then, lu = li«i(l — ?[i6i) 
l/r = luH - ql) 


Cx = t'(i - q”) 

The select section of the service table is then computed by the for¬ 
mula: 




Cl 


1 — 

’'[*1 


( 7 ) 


It win be noted that from the way the table is constructed we have 
the following relationships: 


C 


’*+1 


[«] 

Cl 


Those among the insured new entrants iV^ who also 
earned qualifying wages in the base year n + 1 

9 

Those among the insured permanent employees who 
also earned qualified wages in the base year n -1-1 


E» 


( 8 ) 


•(8a) 
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We now compute the following actuarial functions: 

Dm = and = v^l', (9) 

where » = 1/1+*, i being the interest rate used in the calculation. 

W. » Z>;&. + + • • • + D'jb^ (10) 

where a denotes the limiting age which in practical situations will not 
be more than 80. 


Wl., ^D^bM + 

(10a) 

*A r,] = V” - 

1*1 

Ixl 

w. 

M* = - 

D' 

(11) 

(lla) 


In formulas (11) and (lla), m is the period (in years) between the 
end of the base year and the middle of the corresponding benefit year. 

The ‘A* represent the average lump-sum present value of all future 
benefit payments bused on wages in the base year just ended and on 
future base years during which the individual will earn qualifying 
wages continuously from the present year on. 

LEVEL COST CALCXJLATIONS 

Assume that a valuation of the sickness insurance plan is to be made 
as of the end of the calendar year n. On the valuation date there will be, 
say, insured permanent employees, and iVH insured new entrants. 
In addition, benefits may still bo payable for the remainder of the bene- 
year with rospcct to the and groups. Lot us furthermore 
assume that the balance in the account on an accrual basis on the valua¬ 
tion date, that is the funds on hand plus any additional income due 
but not yet received less disbursements obligated but not yet paid, is 
R\ 

The present value of the liabilities with respect to persons insured on 
the valuation date may then be computed as follows: 

With respect to the insured permanent employees 

( 12 ) 

With respect to the insured new entrants JVH 


( 13 ) 
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With respect to the workers eligible for benefits in the remaining 
portion of the current year 

= vi^-hY. + J\rr'6w) / (14) 

X 

where h denotes the fraction of the remaining current benefit year, and 
/ is a modifying factor to allow for the seasonal variation in disburse¬ 
ments throughout the benefit year. 

Total Li = -h i" -I- Lo. (15) 

At this junction it becomes necessary to compute pay roll functions 
as follows: 

Present value of 1 per cent of all future taxable pay rolls 

CO 

p = .01(1 + ly E (16) 

L-l 

Here j is less than 1 and (1+i)’ is designed to make allowance for the 
fact that the contributions on the average come in sooner than at the 
end of the calendar year. Thus, for instance, if contributions with 
respect to a ^ven calendar quarter are collected at the end of the suc¬ 
ceeding quarter, j may be taken as J. The individual present value 
•P« for an insured worker age ® of 1 per cent of future creditable wages 
taken in account only as long as the individual continues to earn quali¬ 
fying wages in consecutive years beginning from the present time is 
obtaiaed from the formula: 


«—X ^ 

•P, - .01(1 + ip - (17) 

The present value on the valuation date of 1 per cent of such future 
wages for all employees insured by reason of having qualif 3 dng wages in 
the last base year thus becomes: 

Pi= E(i?: + JV;)-*P.. (18) 

X 


The individual preset value as of the end of the year of entry for fu¬ 
ture entrants, otherwise dmilar to ‘P, is: 


•P[.i = .01 


(■Dw'Sw +jDin**P»fi) 


D 


1*1 


(19) 
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The theoretical adjustment for interest on the component is 

negligible and can be omitted. 

For a group of, say, 10,000 future entrants we have the present 
value of their benefits (following them only as long as they continue to 
cam qualifying wages in successive years) 

L'= 10,000Dr. »Aizi (20) 

X 

and the present value of 1 per cent of th<mr creditable wages during the 
period of continued qualifying wages is 

Pi >= 10,000 Er.‘P m- (21) 

X 

Thus, the level cost of benefits to future entrants as a per cent of their 
pay roll is 


L' 



The final cost calculations are then as follows: 


Item 


Symbol Formula 


A. Balance in the account 

B. Liabilities with respect to workers with qualifying 
wages in either the last base-year or in the previous 
one 

C. Present value of 1 per cent of future taxable pay rolls 

D. Present value of 1 per cent of future pay rolls with 
respect to workers who will fail to earn qualifying 
wages 

E. Present value of 1 per cent of future pay rolls for in¬ 
sured employees taken only so long as they continue 
to earn qualifying wages, considering only workers 
with qualifying wages in the last base year 

P. Present value of 1 per cent of pay roll of future 
entrants so long as they will continue to earn quali¬ 
fying wages, (C-D-E) 

G- Normal level percentage cost for future entrants 

II. Value of benefits for future entrants (item G times 
item F) 

I. Total net liabilities (B+BT—A) 

J. Net level cost on the valuation date as a per cent of 
the taxable pay roll 

K. Loading for administrative expenses and con¬ 
tingencies (per cent of taxable pay roll) 

L. Total cost 


Li (16) 

P (16) 

gP ( 5 ) 

Pi (18) 

P2 

t ( 22 ) 

X> 

L 

L/P 

L/P+e 
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CONCLUDING REMARKS 

1. The formulas presented before are related to an estimate extend¬ 
ing into perpetuity. It is apparent that the same type of formulas can 
be set down for a calculation limited to a given number of years. Such 
a modification would not involve any special theoretical difiiculties. 

2. The service table has been developed without the use of death 
rates. This approach was taken because in practice reliable statistics 
on deaths in service would not be available. It should be pointed out 
that for all practical intents and purposes deaths have been taken into 
consideration since in the development of the probabilities of loss of 
insurance the expresaons on the right hand side of equations (1) in¬ 
clude separations through death. In addition, deaths among indi¬ 
viduals insured in a given benefit year were also accounted for by using 
as the numerators of equations (2) the total sickness benefits actually 
paid. If the existence of applicable mortality statistics could be as¬ 
sumed, a modified service table could easily be constructed, although 
such a refinement would probably not be warranted. 

3. The level cost calculations proceed on the tacit assumption that 
the ratio g (formula 5) will remain constant throughout the years. 
Such will undoubtedly not be the case. However, the ratio g is bound 
to be small so that cyclical or other variations in its value will not have 
a significant effect on the results of the valuation. The annual wage 
data published by the Railroad Retirement Board (Compensation 
and Service of Railroad Employees) give some indication of the mag¬ 
nitude of this ratio. Thus, in 1945, less than 1 per cent of the total 
creditable compensation went to employees who earned in that year 
less than $150. 

4. The method makes no allowance for future changes in relation¬ 
ships between benefits according to age, individual wage levels, age dis¬ 
tribution of new entrants, and benefit provisions. In this respect, the 
procedure is similar to the one commonly used in pension fund work. 
Instead of trying to introduce the probable variations directly in the 
calculations, it would be much simpler to make two supplementary 
estimates according to so called “high* and “low” assumptions. Since 
cost estimates of this kind are subject to considerable errors regardless 
of the mathematical refinements introduced, the supplementary calcu¬ 
lations are in any event necessary for the establishment of a proper 
range for the cost figure. 



COST ESTIMATES FOR PUBLIC HEALTH INSURANCE 


73 


ACKNOWLEDGMENT 

The author is indebted to Mr. Joseph Musher, Actuary of the Rail¬ 
road Retirement Board, for many valuable suggestions and for his 
critical review of this paper. 


REFERENCES 

[a] Construction of service table: Life CorUingencies, by E. F. Spurgeon, Cam¬ 
bridge University Press, London, 1932 (Chapters I, II, and XXI). 

[b] Actuarial functions: U. S. Life Tables and Actuarial Tables 19$9-41t Bureau 
of the Census, 1946 (Part IV). 

[c] Graduation and interpolation: Elements of Graduation, by M. D. MiUer, 
The Actuarial Society of America and the American Institute of Actuaries, 
1946. 

[d] Valuation of pension funds: 

1. "On Staff Pension Funds,’* by George King, Journal of the Institute of 
Actuaries XXXIX, pp. 129-184. 

2. "Actuarial Valuations of the Assets and Liabilities under the Railroad 
Retirements Acts,” Annual Reports of the Railroad Retirement Board for the 
years 1940,1943, and 1946. 

[e] Mathematics of private sickness insurance: "Grundzugc der Mathematik der 
Privaten Krankheitskostenvcrsichorung,” by Friedrich Rusam, Transactions 
of the Twelfth International Congress of Actuaries, Lucerne, 1940, Vol. IV, pp. 
147-167. 



EARNINGS OF NONFARM EMPLOYEES IN THE U. S., 

1890-1946 

Stanley Lebebgott* 

MordreaL^ Canada 

Estimates of money and real earnings of nonfarm em¬ 
ployees are presented for the years 1890 to 1946, these es¬ 
timates reflecting the loss in earnings which arises from 
unemployment. Series are likewise given for full time equiv¬ 
alent earnings of nonfarm employees as a group and for each 
of the major industry groups for 1919-46, while the Bureau of 
Labor Statistics and the Bureau of the Census have developed 
estimates of unemployment for the period 1920-46. The 
sources of the rise in earnings after World War I are analyzed 
and it is concluded that about three-fourths of the rise in 
earnings derived from increased pay on a given job, while 
one-quarter originated in the changing distribution and char¬ 
acteristics of nonfarm employees. 

S INCE the monumental endeavors of Paul Douglas first provided us 
with estimates of wages for the period 1890-1926, a massive devel¬ 
opment in the basic materials for income estimation has taken place.^ 
The National Bureau of Economic Research and the Department of 
Commerce have between them provided estimates of full time earnings 
for the period 1919-1946,* and the Bureau of Labor Statistics has 
developed estimates of unemplojonent for the period 1920-1940.® The 
present study draws upon these materials and those of Douglas to 
provide estimates of; 

Money and real earnings of nonfarm employees, 1890-1946. 

Money earnings of employed workers by major industry groups, 
1919-1946. 

Attention is also given to the chief sources for the rise in earnings dur, 
ing the period following World War I.^ Section 1 discusses the generaj 

* The -news eaepressed in this article are those of the author and do not necessarily reflect those of 
any institution "with which he is connected. The writer wishes to acknowledge the assistance of Sophia 
Cooper in the preparation of Tables 5-7. 

1 Faul Douglas, Real Wages in the United States, iSGO-lQSe, 1930. 

* Simon Kuanets, National Income and Its Composition, 191»^19S8, 1941; Survey of Current Busi¬ 
ness, National Income Supplement, July 1947. 

* Frelumnary BLS estimates for 1929-40 appear in the writer's technical memorandum of July 
1945. The estimates for 1920-28 are on a comparable basis and will be released at a later date. 1940-1946 
data are from Bureau of the Census, Lobor Force, Employment and Unemployment in the United States, 
1940-1946, 1947. 

* Throughout this paper references are to employees in nonfarm industry. The term earnings refers 
to wages, salanes and supplements when used in connection with Tables 1 and 2 and the related text, 
but does not include supplements in other contexts. 
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trend of earnings and the sources of the rise in earnings; Section 2 is 
devoted to a discussion of full time earnings by industry; while Section 
3 deals with the actual derhation of the estimates. 

The limitations of the basic data used in this study have been dis¬ 
cussed in the sources from which they are drawn and are by now well 
known. But by way of caution it may be reiterated that estimates of 
unemployment, earnings and cost of living in the years prior to 1930 
are inevitably imperfect. The estimates presented in this paper par¬ 
take of these imperfections. Nevertheless such estimates are required 
to give us a somewhat better perspective on the course of wage earner 
incomes than can be gotten from a review of data for the years since 
1930. It is believed that the present series does give such a perspective, 
being of the right order of magnitude, moving in the right general direc¬ 
tion and being an improvement over existing estimates for this period. 


The money earnings of the average nonfarm employee rose by 251 
per cent over the half century, 1890-94—1940-44, while real earnings 
gained by 68 per cent over the same period. As Tables 1 and 2 empha¬ 
size, the gain in real earnings over the decades from McKinley to 
Coolidge was a most moderate one, the real rise beginning in the much 
mal^ned years of the now prosperity. The effect of the depression was 
so comprehensive, so catastrophic, however, that real oaminp in 1930- 
34 were almost identical with those in 1890-94, forty years before. 
Despite a vast and persistent gain in wage rates—^full time equivalent 
eam in p rose 136 per cent in the interval—the fact that almost a fifth 

lABLn 1 

EAIININGS OF NONFARM EMPIX)'i7X8 
isw-toie 


Full Time 

Equivalent harninics 


IS90-04 

$ 553 

1895 99 

542 

1900-04 

590 

1906-09 

641 

1010-14 

700 

1915-19 

978 

1920-24 

1,392 

1925-29 

1,492 

1930-34 

1,307 

1035-39 

1,366 

1940-44 

1,010 

1045-46 

2,508 


Money 

Xteal Earninys 

T]ainincn 

(1010-14 dollura) 

% 502 

S 636 

4f>8 

597 

554 

644 

697 

647 

649 

649 

918 

678 

1,235 

682 

1,384 

781 

960 

640 

1,103 

721 

1,763 

1,066 

2,408 

1,261 



TABLE 2 


MONEY AND HEAL EARNINGS OF NONFARM EMPLOYEES 
1890-1946 
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TABLE 2 (Continued) 



Full Time 
Equivalent 
Earning*) 

Per cent of Time 
Lost by 
Unemployment 

Money 

Earnings 

Cost of Living 
(1910-14a 
100.0) 

Real Earnings 
(1910-14 
dollars) 


1,470 

20.1 

1.176 

143.1 

821 

1941 

1,623 

13.6 

1,402 

150.2 

933 

1042 

1.806 

6.4 

1,776 

166.4 

1,067 

1943 

2,195 

2.6 

2,138 

176.5 

1,211 

1944 

2,365 

1.6 

2.327 

179.2 

1,299 

1945 

2,456 

2.6 

2,392 

1&3.4 

1,304 

1946 

2,560 

5.3 

2,424 

198.9 

1.219 


of our labor force was without work had completely nullified the gain 
in rates. Within the next ten years the advent first of a quaUfied pros¬ 
perity and then of a war production program had changed famine 
to feast; real earnings rose by 67.0 per cent as compared to the 0.6 per 
cent gain over the previous four decades. Money earnings doubled in 
the six years followir^ the outbreak of both World War I and II. But 
real earnings rose 69.0 per cent in the later period, thanks to effective 
price control, as compared to a more 15 per cent in the earlier period. 

The scrupulous reader will keep in mind the inadequacies of any cost 
of living index which, even at best, can only take inadequate account 
of quality changes. But though doing so he may choose to set off against 
such errors the higher costs of urban existence which a cost of living 
index will likewise tend to ignore. While the measurement of family 
income is a stage removed from our present concern it seems likely that 
the trend of that scries would have been amilar to that of the present 
scries.* And as the present series are not measures of family income, so 
they are not measures of labor costs. Information on manufacturing 
for the 1920-43 period, for example, very definitely emphasises the 
point that labor costs may decline sharply at the same time that wage 
rates and aimual earnings gain just as sharply. 

While it is not possible to establish the chief sources of the rise in 
earnings over the entire half century a very rough estimate can be 
made for the years 1919-23 to 1939-43 when money earnings rose some 
18 per cent. The over-all rise in full time equivalent earnings had been 
21.6 per cent, 4 points of which had been offset by increased unemploy¬ 
ment, leaving 17.6 per cent as the rise in actual money earnings. (It is 

* In 1901 tbe head of family provided 80 per cent of the family’s income: in 1935-36 the fieure was 
82 per cent. The 1901 estimate is based on data appearing in the Commissioner of labor's Rexmrt 
(1904)«pp. 362,366. The 1936-36 estimate is based on ratios to be derived from the various BLS Con- 
Purchases volumes, these being weighted by the distribution of nonfann familie s given in Conr 
tumor Incomes in^e CfnifedStoteSa National B«90uice8Coininittee« Table 24 B. 
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of interest that so sizable an offset for unemployment occurs: the earlier 
years included the entire postwar depression while the 1939-43 period 
included at least 3 years of substantial defense and war production.) 

Betuming to the 21.6 per cent rise in full time equivalent earnings, 
what can we say of the sources for that rise? An exceedingly rough ap¬ 
proximation suggests that upwards of three-quarters of the gain came 
about because of iucreased earnings on a given job, while the balance 
derived from changes in the compodtion of the labor force. Three 
elements are significant in this composition shift: industrial change, 
occupational change, and age change. 

The importance of the industrial change may be approximated by 
using a constant 1919-23 industrial pattern for both periods in place 
of the actual ones. Tables 3 and 4 present employment and earnings 
estimates for both 1919-23 and 1939-43.' By assuming a constant dis¬ 
tribution of weights between the manufacturing sub-groups and be¬ 
tween the major industrial groups we secure an estimated income gain 
of 1.6 points less than the actual gain. In other words 1.6 out of the 
21.6 points gain in full time earnings may be assumed to have derived 
from the change in industrial composition. 

But more than a shift in industrial composition was at work. At the 
end of the period the average worker was more highly skilled. What 
portion of the rise in earnings stemmed from this source? Table 5 pre¬ 
sents the data used under the assumption of a constant occupational 
composition of the employee work force. Occupation data are readily 
available and as here presented exclude both professional workers and 
retail trade proprietors, managers and officials, since so large a propor¬ 
tion of both groups is composed of self-employed persons.^ 

Earnings data are less readily come by, and even conceding a number 
of points we must assume that a comparison can usefully be made by 
taking 1920 and 1939 occupational earnings as the same. (It can be 
demonstrated that using such an assumption for the industrial data, 
where we do have earnings at both dates, makes little difference in the 
results.) We can fairly well estimate what proportion of the experienced 
workers in the labor force who were employed 12 months in 1939 were 
not wage and salary workers—Whence ones whose incomes would not be 

* The employment data are derived from Kuznets and The National Income Suptlemeni, nvith ad¬ 
justments for comparability as described below in sections, whmre a description of the derivation of earn¬ 
ings data likewise appears. For manufacturing, Uie 1939-43 income using constant sub-group weights 
was $1,693 as compared to the actual $1,764. 

* Alba Edwards, ComparoUive Occupation Statiatiee, 1870-1940, 1943, pp. 186-187. For 1940 the 
number of emplosred is used; for 1920, the number of gainfully occupied. It is assumed that the changing 
distribution of self-employed persons witibm the specified occupational groups would not materially 
affect the results. 
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representative of employees earnings in a given occupational group.® 
By assuming these workers to cluster at the lower end of the wage and 
salary income distribution wc can then derive median wage and salary 
income of full time employees in the various occupational groups.® 
Combining these medians first with a constant and then a varjring 
occupational distribution we arrive at practically identical figures for 
1920 and 1940. Taking account of the differences between the 1920 dis¬ 
tribution and the 1919-23 distribution, as well as those between 1940 
and 1939-43 we can conclude that there was probably a slight, though 
insubstantial, gain because of occupational change. 

A third factor to be considered as a source for the rise in earnings is 
the changing age distribution of the labor force. By comparing the 
urban workers age distribution of 1920 with that of 1940 and assuming 


TABLE 3 

EARNINGS AND EMl»LOYMENT BY MANUFACTURING INDUSTRY 
1919-23 and 1939-43 



Aveiage 

Full Time Equivalent 
Employees 

Aveiage 

Full Time Equivalent 

Incomes 

1019-23 

1939-43 

1919-23 

1939-43 

Food 

849 

1,299 


1,551 

Tobacco 

166 

102 


1,140 

Textile 

1,161 

1,306 


1,209 

Appaicl 

618 

1,003 


1,226 

Lumber 

679 

651 


1,113 

Furnituie 

163 

423 

1,263 

1,372 

Paper 

233 

360 

1,340 

1,689 

Printing 

442 

566 

1,009 

1,803 

Chemicals 

448 


1,667 

1,948 

Petr, cool 

111 


1,844 

2,227 

Rubber 

120 


1,523 

1,001 

Ijcather 

316 

385 

1,283 

1,284 

Stone 

32a 

399 

1,393 

1,620 

Iron 

06.3 

1,700 

1,591 

2,007 

Nontcrious 

320 

390 

1,404 

1,951 

Machinery 

3,614 

2,943 

1,567 

2,103 

Auto 

362 

513 

1,674 

2,359 


< From the 1940 Census of Population, Occupational CJtaraeterUtxeCt Table 13, we can compute the 
per cent of waso or salary workers in each occupational Kroup that worked 12 months. Applying these 
pcK^ontagcs to the numlicr of ware earners in the complete count (1040 Census of Population, The 
Labor Foret, Pait I, Table 72), deducting the result from the total of c.cperienccd persons in the labor 
ioxce 12 months, we have the number of non-wage or salary workers that worked 12 months in 1939. 

• Medians were computed by deducting the number of non-wage or salary workers who worked 12 
months from the total experienced labor force who worked 12 months, cumulating from the $0 income 
group. (Income distiibution from The Labor Force, loc. cii.) The cluatering premise is clearly inade* 
quate. Just os clearly its bias is not going to make much difference in the medians. 
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TABLE 4 

EAENINGS AND EMPLOYMENT BY MAJOR INDUSTRY GROUPS 
1919-23 aod 1939-43 



Average 

Full Time Equivalent 
Employees 

Average 

Full Time Equivalent 

Income 

1919-23 

1939-43 

1919-23 

1939-43 

Mining 

1.032 


31,696 

$1,658 

Construction 

1,245 


1,482 

1,787 

Trade 

4,074 


1,417 

1,531 

Finance, insurance 

996 

1,344 

1,809 

1,860 

Service except 





domestic 

2,137 

3,174 

948 

1,079 

Domestic 

1.785 

1,942 

598 

637 

Transportation 

2,655 

2,273 

1.536 

2,007 

Utilities 

570 

915 

1,251 

1,826 

Manufacturing 

9,078 

13,335 

1,377 

1,764 

Government 

2,901 

5,153 

1,210 

1,631 


the same difference between earnings at different ages existed in 1920 
as did in 1939 we find about a 3 per cent rise in earnings because of 
changes in age composition.This 3 per cent rise is based on a 5 per 
cent rise because of changing age composition, 2 points of which were 
lost because of the increased importance of women in the labor force, 
women generally receiving lower incomes than men. 


TABLE 5 


EMPLOYMENT (1920 AND 1040) AND EARNINGS OP FULL TIME 
EMPLOYEES (1939) BY OCCUPATION GROUP 




Male 



Female 



Employment 

(000) 

Median 
Full Time 
Eatnings of 

Employment 

(000) 

Median 
Full Time 
Earnings of 


1920 

1940 

Employees 

1939 

1020 

1940 

Employees 

1939 

Proprietors and man¬ 
agers exc. trade 

1,314 

1,664 

$2,600 

78 

193 

in 

Clerks 

3.491 

4,785 

1,500 

2,191 

3,248 


Skilled workers 

5,469 

5,089 

1,600 

102 

87 


Semi-ekilled workers 

4,371 

6,190 

1,260 

2,280 

3,072 


Laborers exc. farm 

5,819 

3 648 


200 

100 

760 

Servants 

675 

1,300 

1,000 

1,570 

2,346 

400 


10 age distributions are based on those for non-agricultural gainful workers other than those 
in trade as given in the 1920 Census of Population, Vol. IV, p. 876, and the 1940 Census of Population, 
VoL III, Pt. I, pp. 197, 199, Earnin gs by age and sex are from 1940 Census of Population, Wage and 
Salary Ineomea in 19S$, p. 107. 
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It is not possible simply to add the 1.6 point rise in earnings because 
of changing industiial composition, the 3.0 per cent rise because of 
changing ago composition and the gain, say of 0.5 per cent, because of 
varying occupational composition. This is so partly because the figures 
may not be exactly compared, partly because the changing industrial 
composition overlaps the age change, and so on. We must therefore be 
content with the understanding that of the 22 per cent gain in full time 
earnings something upwards of three-quarters arose because of a rise 
in earnings on a given job and a relatively small amount derived from a 
rising level of skill in the labor force or the rising level of maturity. 
(With the essential addendum that part of the rise which did take place 
was nullified by the increased role of women in the labor force.") 

II 

Comparable estimates of full time equivalent earnings by industry 
for the years 1919-1943 are given in Tables 6 and 7. In terms of full 
time earnings as, Chart I emphasizes, the various manufacturing indus¬ 
tries retained their general position with reference to one another over 
the period 1919-23 to 1939-43. Petroleum and coal products manu¬ 
facturing was the highest paying industry in both quinquennia while 
tobacco products was the lowest in one, next to the lowest in the other. 
The sharpest shift was that of printing, which fell from 2nd to 7th 
place, while nonferrous manufacturing rose from 7th to 4th. The 
similarity of structure is likewise evident in data for major industrial 
groups (Chart II), although it is nowhere near as neat." 

In general the per cent changes in income were related to the changes 
in employment as Chart III suggests for manufacturing industries and 
as tho freehand curve in Chart IV less reasonably suggests for major 
industrial groups. It is clear that there is a close relationship between 
changes of income and employment in manufacturing, a much rougher 


u Tnnsmuch as tho carninfcs data used for tho occupational and age calculation were from the 1940 
Census it might be noted that by also using median earnings data by industry for 1930 from the censoa 
(Wai;e and Salary Income in iOSO^ pp. 146-147) in place of the 1039-43 full time equivalent averages, 
one would got a rise one-half of one per cent greater. 

The reader may not unreasonably bo surprised by the small gains attributable to occupational 
and industrial shifts. Several factors might be noted. (1) These data apply only to nonfarm employees; 
the riding importance of service industries and the declining importance of agriculture is therefore irrele¬ 
vant. (2) A riaing proportion of women in the labor force has tended to limit the rise in the average wage. 
(3) Because in point of fact a rise ascribed to a change in industrial composition only came about ihroufi^ 
the operation of changes in age and occupations as well it will not do to emphasise one factor to the 
exclusion of the others. 

>> A more extended disoussioni apjpeara in the writer’s *Wage Struotures,” Bemew of Economic 
SUaisUm, November, 1947. 
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CHART I. 

EARNINGS AND EMPLOYMENT BY MANUFACTURING INDUSTRY, 
1919-23 AND 1939-43 



AVERAGE FULL TIME INCOMES, 1919-23 


CHART 2. 

EARNINGS AND EMPLOYMENT BY MAJOR INDUSTRY GROUPS, 
1919-23 AND 1939-43 



AVERAGE FULL TIME INCOMES, 1919-2? 



PERCENT CHANGE IN INCOME, 1919-23 TO 1939-43 


CHART 3. 

PERCENT CHANGE IN FULL TIME EQUIVALENT EMPLOYMENT AND INCOME, 
1919-23 AND 1939-43, BY MANUFACTURING INDUSTRY 



-20 0 eo 40 60 80 100 »0 

PERCENT CHANGE IN EMPLOYMENT. 1919-23 TO 1939-43 


CHART 4. 

PERCENT CHANGE IN FULL TIME EQUIVALENT EMPLOYMENT AND INCOME, 
1919-23 AND 1939-43, BY MAJOR INDUSTRY GROUPS 



Enatom: For TKADB io upper left quadrant (30 7,-14 l) read TBAKSPOSTATIOH'. 
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one for major industries." Whether this contrast arises from the fact 
that productivity changes in manufacturing industries tended to occur 


T.\BLE 6 


ATESAGS ANNUAL EABNINGS PER FULL TIME EMPLOTEE 
Major Industry Groups 


Year 


Contract 

OonstTuo- 

tion 

Manu- 

faeturiog 

Wholesale 

and 

RetaO 

Trade 

Finance, 
Insurance 
and Real 
Estate 

Transpor¬ 

tation 

Communi¬ 
cation and 
Public 
Utilities 

Domestic 

Service 

Service 

except 

Domestic 

WM\ 

nn 



(in 

dollars) 





1919 

Bf^ 

1,394 



1,578 

1,403 


510 

1,103 

1920 


1,709 

1,532 

B|B9 

1,757 

1,694 

BMfl 

578 

1,201 


BIfttyB 

1,386 

1,345 

■m 

1,865 

1,571 

1,300 

620 

1,201 


1,657 

1.305 

1,295 

1,419 

1.939 


1,290 

633 

■ 


1,831 

1,614 

1,410 

1,472 

1,904 

1,513 

1,313 

647 


m lyzm 

1,738 

1,621 

1,431 

1,462 

1,954 

1,529 


668 



1,620 

1,656 

1,454 

1,539 


1,553 


681 


1926 

1,655 

1,664 

1,478 

1,583 


1,569 


684 


1927 

1,630 

1,707 

1.503 

1,522 

2,033 

1,579 

1,443 

690 

1,324 

1928 

1,570 

1,717 

1,535 

1,546 

2,059 

1,609 

1.474 

688 

1,313 

1929 

1,526 

1,674 

1,543 

1,597 

2,090 

1,642 

1,474 

701 

1,371 

1930 

1,424 

1,526 

1,488 

1,568 

2,001 

1,610 

1,497 

650 

1,373 

1931 

1,221 

1,233 

1,369 

1,497 

1,886 

1,549 

1,514 

561 

1,326 

1932 

1,016 

907 

1,150 

1,318 

1,687 

1,373 

1,438 

477 


1933 

990 

869 

1,086 

1,187 

1,591 

1,334 

1,351 

442 

1,119 

1934 

1,108 

942 

1,153 

1,232 

1,635 

1 393 

1,426 

455 

1,129 

1935 

1,154 

1,027 

1,216 

1,281 

1,668 

1,492 

1,486 

467 


1936 

1,263 

1,178 

1,287 

1,299 

1,747 

1,582 

1,522 

487 

1,181 

1937 

1,366 

1,278 

1,376 

1,356 

1,819 

1,644 

1.601 

536 

1.215 

1938 

1,282 

1,193 

1,296 

1,357 

1,762 

1,676 

1,674 

506 

1,220 

1939 

1,367 

1,268 

1,363 

1,365 

1,761 

1,723 

1,692 

520 

1,235 

1940 

1,388 

1,330 

1,432 

1,391 

1,754 

1,754 

1,718 

533 

1,240 

1941 

1,579 

1,638 

1,653 

1,491 

1,805 


1,766 

578 

1,291 

1942 

1,795 

2,194 

2,023 

1,626 

1,918 

E'SI 

1,881 

678 

1,400 

1943 

2,160 


2,350 

1,804 

2,071 



876 


1944 

2,499 

2,602 

2,517 

1,965 

2,203 

2,677 

2,248 

1,080 

B~irB 

1945 

2,618 

2,612 

2,525 


2,365 

2,732 

2,416 

1,230 


1946 

2,677 

2,581 

2,512 


2,567 

2,937 

2,560 

1,328 



in unison and proportion unlike those in nonmanufacturing, or whether 
it arises from still other factors cannot be sufficiently dealt with in the 
compass of the present article. 


u The relataondiip between income and employment changes in manufacturing, and these changes 
in lotion to productivity gains, has been ably discussed in Solomon Fabricant's Employment in 
Man^f(u^tming, 189&^19S9,1942, especially pp. 100-113. 
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III 

To establish the basic series for real earnings of nonfarm employees 
three main constituent series are required: one for full time equivalent 
earnings, a second for per cent of time lost by unemployment and a 
final series for changes in the cost of living. The derivation of each of 
the required scries will be discussed in turn. 


FuH time equivalent earnings. Series for the 1929-1940 period ate 
available from the Denison estimates for the Department of Com- 
merce.“ For 1919-28 estimates have been made by Kuznets, but these 
estimates are not comparable with the Denison figures.^* The series 
were examined for the overlap period to determine the best method of 
linking. For all groups the similarity of movement was obvious, but 
the levels differed in most instances. After a number of necessary re¬ 
groupings of industries had been madc,“ the various scries were then 
linked on the basis of the regression relationship for the period 1929- 
38.” In a number of instances the computed Commerce-level estimates 
were then ratio linked to the Commerce estimates for 1929 to prevent 
a reversal or exaggeration of the 1928-30 movement. 

This 1919-43 series was then extrapolated to 1910 by the relationship 
between this series and that of W. I. Fang, in the overlap period.”-” 
This series in turn was further extrapolated to 1890 by the 1910-26 


u Survey of Current Suaineee, National Income Supplementf July 1947. 
u Simon Kuznets, National JMome and Jte Compot,itiont WIO-IOSS (1941). 
u The following ndjustmentH in Kuznets' senes were made: hosiety and knit goods added to testile 
mill, deducted from apparel; leather footwear deducted from apparel, added to leather; rubber footwear 
deducted from apparel, added to rubber; heating apparatus added to iron and steel; restaurants added 
to trade, deducted from service. 

vt Lumber and timber basic products; printing and publishing; iron and steel were considered to be 
directly comparable. 

The abHolute dilFeronces between the Kuznets and Denison series are for the most part small enough, 
the similarity of movement great enough, that it is oven possible to splice the series by a simple ratio 
link. Except for these three groups, however, rogiossions were used. 

IB W. 1. King, The National Income and Purehai>ing }*owcrt 1030, pp. 56, 60, 122. Bowden has re¬ 
vised some of the basic data which King relied on, but the use of these substitute figures makes little 
difference in the final estimates here presented. Cf. Witt Bowden, War and Po><-Wor Wopei, Prieee 
and //ours, 1014 - 2 S and 10S9-44, BLS Bulletin 852. Additional data of Bowden's, for earlier years, 
appear in the MontJdy Labor Renew, September, 1940. 

King's figures have been adjusted by Kuznets witb a view to comparability (Kuznets, op, cU., 
Vol. II, pp. 469-474) but those adjustments do not affect our present concern. While King's implicit 
average oompensation figure for nonfarm employees is $1,220 in 1919 and $1,479 in 1920, the estimates 
here presented are $1,260 and $1,448 respectively. King's figures were therefore assumed to be directly 
comparable with the present estimates. The 1919-43 series oould, of course, have been extrapolated all 
the way back to 1890 by Douglas. But instead of such a procedure, based on the 8-year overlap from 
1919-26, it seemed preferable to use King's figures back to 1910, thus producing an independent series 
which oould be related to the Douglas series for the 17-year period, 1910-26. 



TABLE 7 (FABT 1) 

AVERAGE ANNUAL EARNINGS PER FULL-TIME EMPLOYEE 
Manufacturing Industries 
1919-1946 


Year 

Food 

and 

Eiodrod 

Products 

Tobacco 

Manu¬ 

factures 

Textile 

MiU 

Products 

Appard 

and 

Other 

Finished 

Fabric 

Products 

Lumber 

and 

Timber 

Baaio 

Products 

Furniture 

and 

Finished 

Lumber 

Products 

Paper 

and 

Allied 

Products 

Printing, 

Pubhsh- 

and 

AlUed 

Industries 

Chemicals 

and 

AlUed 

Products 

■I 

1,319 

942 

1,013 

1,237 

fin dollars) 
1,087 

1,151 

1,236 

1,320 

1,467 



1,083 

1,226 

1,405 

1,376 


1,461 

1,666 



1,448 

950 

1,052 

1,284 

1,013 

1,282 

1,320 


1,531 


1,386 

950 

1,001 

1.275 


1,258 



1,499 

1923 

1,437 

988 

1,110 

■ 9 


1,317 

1,375 

1,737 

1,556 

1924 

1.468 

981 

1,099 

■ 9 

1,107 

1,345 


1,812 

1,581 

1925 

1,460 

1,001 

1,114 


1,121 

1,359 

1,421 

1,868 

1,618 

1926 

1,484 

1,006 

1,119 

1,331 

1,135 

1,385 

1,452 

1,964 

1,627 

1927 


976 

1,151 

1,364 

1,155 

1,386 

1,470 

1,969 

1,644 

1928 

1,516 

965 

1,141 

1,362 

1,164 

1,379 

1,501 


1,686 

1929 


979 

1,156 

1,361 

1,172 

1,398 

1,514 

2,010 

1,673 

1930 

1,489 

985 

1,096 

1,265 

1,156 

1,310 

1,487 


1,647 

1931 

1,451 

908 

1,039 

1,162 

1,010 

1,196 


1,943 

1,608 

1932 

1,303 

787 

847 

941 

787 

962 


1,740 

1,410 

1933 

1,204 

725 

829 

900 

737 

900 

1,143 

1,599 

1,312 

1934 

1,221 

750 

883 

987 

791 

048 

1,186 

1,044 

1,341 

1935 

1,253 

778 

926 

1,016 

833 

988 

1,235 

1,698 


1936 

1,290 

817 

952 

1,013 

911 

1,074 

1,313 

1,702 

19 

1937 

1,351 

883 

994 

1,025 

963 

1,123 

1,403 

1,722 

! i 

1938 

1,331 

870 

926 

999 

940 

1,102 

1,359 

1,697 

1,621 

1939 

1,372 

916 

960 

1,025 

956 

1,138 

1,414 

1,718 

1,611 

1940 

1,385 

1,000 

986 

1,022 

934 

1,158 

1,458 

1,764 

1,723 

1941 

1,472 

1,117 1 

1,159 

1,159 

1,026 

1,304 

1,646 

1,852 

1,893 

1942 

1,650 

1,240 

1,385 

1,330 

1,204 

1,514 

1,850 

1,973 

2,139 

1943 

1,879 

1,431 

1,556 

1,595 

1,449 

1,743 

2,076 

2,158 

2,386 

1944 

2,044 

1,580 

1,681 

1,788 

1,564 

1,892 

2,254 

2,376 


1945 

2,176 

1,693 

1,814 

1,944 

1,617 

1,983 

2,363 

2,677 

2,683 

1946 

2,392 

1,798 

2,037 

2,168 

1,781 

2,203 

2,547 

2,871 

2,752 


relationship between it and the Douglas series.^** 

Time lost in vnemployment. Given a series for full time earnings our 
next requifflte is a parallel one for time lost in uncmplojuncnt by non- 
farm employees so that tbe earnings series can be transformed from a 
full time equivalent one into an actual average series. Total unemploy¬ 
ment estimates for the 1929-40 period have been made available re¬ 
cently by the Bureau of Labor Statistics.** Comparable but unpub- 


M Douglas, Real Wages, Table 147, p. 392, average earnings in all industries exduding farm labor. 
Dou£^ notes that his series represents the movement of earnings of 73 per cent of those who worked 
for wages or salaries in 1920. (Op. ctf., p. 389.) 'Wl^out attempting to revise his estimates on the basis 
of later data it is nonetheless clear that his series includes the b ulk of nonfarm workers hence can 
be readily used for extrapolating a nonfarm earnings series. 

» Cf. The writer’s (Bureau of Labor Statistics, Occupational Outlook Division, July 1945), Prelim^ 
iasary Estimates of Labor Force, Bmptoymentani Unomptoyment, 1S29-1940, 









































TABI.B 7 (PAET S) 


Ptoducts 

of 

Petrolpum 

and 

Coal 

Rubber 

Products 

Leather 

and 

Leather 

Products 

Stone, 

Clay, 

and 

Glass 

Products 

Iron and 
Steel and 
Their 
FioducU 
(including 
Ordnance) 

Non- 

ferrous 

Metals 

and 

Their 

Products 

Machinery 

and 

Transpor¬ 

tation 

Equip¬ 

ment 

(cxeept 

Auto¬ 

mobiles) 

Auto¬ 

mobiles 

and 

Auto¬ 

bile 

Equip¬ 

ment 

Total 

All 

Manu¬ 

facturing 

Year 


1,446 


■1 

(in dollar s) 
1,599 

1,360 

1,403 

1,531 

1,304 

1919 


1,759 



1,882 

1,596 

1,700 

1,818 

1,532 

1920 


1,410 




1,414 

1,643 

1,668 

1,345 

1921 

1.725 

1,442 



1,377 

1.434 

1,446 

1,617 

1,295 

1922 

l,7Ql 

1,530 

1,274 

1,460 

1,622 

1,516 


1,735 

1,410 

1923 

1.782 

1.501 

1,264 

1,536 

1,633 

1,580 

1.629 

1,720 

1,431 

1924 

l.SOt 

1,542 

1,278 

1,610 

1,652 

1,588 

1,658 

1,795 

1,454 

1925 

1,821 

1,673 

1,287 

1,519 

1,667 

1,639 

1,678 

1,786 

1,478 

1926 

1,832 


1,303 

1,546 


1,636 

1,717 



1927 

1,858 

1,633 

1,206 

1,568 

■rS 

1,675 

1,754 

1,864 

1,535 

1928 

1,844 

1,507 

1,327 

1.557 


1,665 

1,756 

1,813 

1,543 

1929 

1.004 

1,563 

1,216 

1,525 

1,640 


1,715 

1,571 

1,488 

■MM 


1,302 

1,152 

1,386 

1,410 

B^ 

1,507 

1,455 

1,369 

1931 

1,610 

1,101 

970 

1,167 

1,044 

1,177 

1,283 

1,234 

1,150 

1932 


1,137 

050 

1,071 

1,073 

1,132 

1,245 

1,170 

1,086 

1933 

1,513 

1,248 

1,017 

1,088 

1,166 

1,209 

1,320 

1,314 

1,153 

1934 

1,587 

1,358 

1,043 

1,171 

1,295 

1,277 

1,400 

1,489 

1,216 

1935 

1,G20 

1,472 

1,045 


1,446 

1,361 

1,520 


1,287 

1936 

1.833 

1,526 

1,085 

1,357 


1,402 

1,658 

1,672 

1,376 

1937 

1,863 

1,457 

1,017 

1,303 


1,402 

1,538 

1,653 

1,296 

1938 

1,852 

1,548 

1,038 

1,359 

1,549 

1,521 

1,653 

1,762 

1,363 

1939 

1,051 

1,583 

1,041 

1,303 

1,643 

1.504 

1,767 

1,934 

1,432 

1940 

2,113 

1,778 

1,236 

1,554 

1,923 

1,824 

2,091 

2,243 

1,653 

1941 

2,410 

2,116 

1,447 

1,771 

2,284 

2,235 

2,592 

2,880 

2,023 

1942 

2,806 

2,478 

1,659 

2,024 

2,637 

2,581 

2,862 

2.078 

2.360 

1943 

3,046 

2,609 

1,831 

2,174 

2,781 

2,724 

3,022 

3,103 

2,517 

1944 

3,093 

2,715 

1,069 

2,252 

2,806 

2,741 

2,004 

2,084 

2,525 

1945 

3,180 

3,826 

2,123 

2,394 

2,687 

2,710 

2,808 

2,796 

2,512 

1946 


lished figxircs for 1920-29 arc liketnse available.^ These data coDsti< 
lute a logical starting point. 

Before this scries can be used as a measure of time lost by nonfarm 
cmplo 3 recs, however, we must consider to what extent the unemploy¬ 
ment of farm hands and self-employed persons is included in that 
scries. If it can be established that unemployment in those groups can 
be balanced off against the failure of the series to allow for the full 
measure of time lost by nonfarm employees, then the series can be 
used as is. 


^ DeRoribed by tho writer in a forthcoming BLS teohnioal memorandum. 





































88 


AMEBICAMT STATISTICAL ASSOCIATION 


On it may be assumed that the nonfarm self-employed con¬ 

tributed nothing to the unemplosonent totals. The small number of 
nonfarm self-employed in April 1940, for example—^roughly 0.2 out of 
4.4 million experienced workers seeking work—was more than com¬ 
pensated for by the number of disemployed workers who, turning to 
self-emplo 3 unent, had disguised their unemployment in that form.” 

Agricultural workers—operators and hired hands—constituted about 
4 per cent of total unemployment in April 1930 and about 8 per cent 
of the experienced unemployed in March 1940.” More than compen¬ 
sating for the inclusion of agricultural unemplo 3 mient is the exclusion 
of two groups of unemployed. First is the group of women and older 
workers who, after being without work for some time, withdraw from 
the labor force.” Second is the group of workers who migrated to rural 
areas because of lack of work in nonfarm pursuits, the relationship be¬ 
tween farm-nonfarm migration and the level of employment being 
well known. It may be added that even if we were to ignore these two 
groups and allow in full for agricultural unemployment, the resultant 
estimate for 1940 would be 18.3 per cent of time lost rather than 19.9. 

Granted these various considerations, therefore, it was decided to 
take the level of unemplo 3 nnent shown by the BLS estimates as equiva¬ 
lent to the level of unemplosonent among nonfarm employees.” The per 
cent of time lost is computed as the ratio of unemplosrment on the one 
hand to employed nonfarm employees plus unemployed on the other. 

Given unemployment estimates for 1920-43 we must next derive 
serviceable estimates for prior years. National unemployment enumera¬ 
tions were conducted and data published in connection with both the 

” 1940 Census of Pbpulation, Employment and Peraonid Charariarvttica, Table 11, with a deduction 
for unemployed farm operators (1940 Census of Population, Occupational Characteristics, Table 6). 
A discussion of the relatively small decline in self-employment appears in the BLi3, EsHmaiee cf the 
Nonrogricultural Self-Employed, 1929-1940 (1945). 

** 1930 Census of Population, Unemjdoyment, Vol. I, pp. 53-54, Class A and B unemplosonent. 1040 
Census of Population, The Labor Force, Vol. Ill, Table 59; data in Table 1 for rural farm areas allow 
ns to take account of new workers and public emergency workers but necessarily include some nonfarm 
workers. The rural farm ratio is 11 per cent. It should be noted that some 20 per cent of the experienced 
persons seeking work in March 1940 who reported their last occupation as farmer or farm ojserator did 
not report their usual occupation in the same group; this contrasts with the oveivall male average of 
10 per cent. (1940 Censusof Population, Usual Occupations, Tables 11 and II.) There is likely to be some 
inflation in agricultural unemployment, therefore, precisely because workers have exported their un¬ 
employment from cities to rural areas, attempting to become farm workers in the process. On the other 
hand, some unemployed farm workers imdoubtedly sought work in the cities. 

“ Gladys Palmer has called attention to “the substantial increase we found in Philadelphia in the 
number of households reporting no employable member as the depression continued.* (With respect to 
the 1929-40 period, the poinc is discussed at greater length in a forthcoming BLS Bulletin.) 

^ Given a continuing substratum of disguised unemployment these percentages are probably some¬ 
what too small. More important for present purposes is the fact that the undoubted gain in the impor¬ 
tance of such unemployment during the depression is not reflected in the estimates of time lost. As a 
xesolt the gain in nonfarm. aMminga over the years is «»»« 4 y giftrated . 
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1890 and the 1900 Census of Population. However it does not now ap¬ 
pear possible to make any use of the 1890 estimates.^^ An estimate for 
1900, however, can be computed from the Census data after certain 
adjustments are niade.^* 

(1) The first such adjustment is the exclusion from both the gainful 
worker and the unemployment totals of those workers who do not come 
within the purview of the post-1940 “labor force* concept. For male 
workers it is possible to make such exclusions with at least approximate 
accuracy, utilizing for the purpose the 1901 Cost of living survey.** 

(2) Since the 1900 Census distributes the unemployed into three 
unemployment duration groups it is then necessary to estimate the 
average duration for each group before total time lost in unemployment 
by males can be estimated. As the 1940 Census data indicate it is not 
possible to secure an accurate average for a group as large as 7 to 12 
months of unemployment merely by taking the mid-point. However, 
the 1901 Cost of Living survey gives a distribution into 23 duration 
groups and it is thus possible to calculate an average duration of un¬ 
employment for men in the nonagricultural labor force in 1899-1900.*® 

(3) In step one the appropriate percentage of total male unem¬ 
ployed nonagricultural gainful workers in 1899-1900 was deducted in 
order to ensure comparability with later data, using proportions from 
the 1901 Cost of Living survey. In step two medians were computed 
for each of the three unemployment groups after they had been thus 
adjusted. In step three a simple multiplication gave the estimated 
total weeks lost in 1899-1900 by the male nonagricultural labor force. 
After corresponding adjustments in the gainful worker figiu^s to give 
total labor force weeks for this group it was possible to compute the 
per cent of time lost by it—4.93.** 

Tho 1800 data, unlike those for 1000 and later years, apply chiefly to unemployment at the 
workoi'H primaly job ruth<*r than to total unemployment. Furthermore there is clear evidence that the 
1000 Census enumerators oiiciaicd with a hotter understandinR of what they were seeking, and hence 
obtained estimates which weie generally higher than those of 1800 merely because the enumeration tech¬ 
nique differed. 1800 Coukus of Population, Fart IX, p. oxzvi. 1000 Census of Population, OceupatioM, 
pp. coxxvi--ccxx.tiii. 

1000 Census, op. c»t„ pp. 78-77, 

Kightecuth Annual Aeport of tho Commissioner of Labor, 1003 (1004), pp. 200-201. It was as¬ 
sumed that those workers whoso unemployment was ascribed to the following causes would have been 
excluded by 1040 definitions: old age, sickness, strike or vacation. On the other hand those for whom 
any of these causes was reported in combination with another cause were included. This was done be¬ 
cause enunierative procedures in current use undoubtedly include some such workers in practice, and 
definitely include many who are temporarily ill and without a job. 

BO Durations, of course, were computed for the groups of workers remaining after the excluded 
workers had been deducted. Data on those unemployed for 52 weeks were taken from page 26 of the 
Report, while all other data appear on pages 290-291. 

Both teachers and manufacturers were excluded from these computations since according to 
ourrent practice few if any would be reported as unemployed. 
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(4) This percentage is very substantially below an estimate of 9.4 
per cent to be derived from the Cost of Living Reports A major reason 
for the divergence is the fact that according to the Report 50 per cent 
of rnftlpi urban heads of families had some imcmployment during a 
year endin g in 1901 while the Census figure for males except in agricul¬ 
ture was 23 per cent.® 

It seems reasonable to believe that in this respect the Cost of Living 
report is the more accurate measure. In such surveys, where income was 
reconciled with expense on a detailed basis of reporting, there was less 
likelihood that short period unemployment would be forgotten than 
there was in the general purpose population enumeration, which could 
necessarily have little check on the solidity of the unemployment im¬ 
ports in the absence of questions on income or related factors. It is 
therefore assumed that the Census figure is an imderestimate. An up¬ 
ward adjustment was made by doubling the number in the lowest 
duration group, exclusive of those in agriculture, teachers and manu¬ 
facturers. This step raised the 1899-1900 percentage to 8.3—or some¬ 
what below the adjusted 1901 figure of 9.4. 

This divergence was ignored because the unemployment figures re¬ 
ported to the Census for nonfarm women are likely to be too high.® 
Combining the estimate of 8.3 per cent with the parallel figure for fe¬ 
male workers an over-all percentage of 7.5 was finally secured.® This 
percentage then serves as a benchmark figure for 1900, comparable 
with the BLS 1920-40 estimates. 

If it were possible to utilize the well known estimates of Douglas 
for the intervening years then a complete imemplo 3 rment series could 

>2 The percentage to be derived directly from the Iteport is 6.8. However, this must be adjusted to 
allow for self-employed and for eiugle persons—both groups being included in the Census data. In 1040 
there were 7 male nonagricoltural employees to 1 employer or own-account worker. (1640 Population 
Census, HI, p. 1.) Though trade, service and construction together made up a much smaller fraction 
of the nonfarm total in 1040—as Carson’s data demonstrate—^we can nonetheless use the 1940 propor¬ 
tion and thus intentional^^ underestimate the spread between the Report and the Census estimates. 
Assuming a 0 rate of unemployment for the self-employed we find the adjusted 1901 rate is 6.0. However 
by assuming that the unemployment rate for urban single workers in 1900 was the same proportion of 
the rate for urban married workers in 1900 as it was in 1940 (1940 Census of Population, III, p. 22) 
the head of family rate as adjusted from the 1901 Report was used to construct a single workers rate. 
The two were then weighted by the distribution of nonagiicultural males by marital status (1900 
Census, Oceupofums, p. 52) with the result that the 6.0 figure rises to 9.4 per cent. 

" Report, p. 287. Occupations, pp. 7, 76. 

** A comparison of unemployment rates by occupations shows that female rates for various occupa¬ 
tions were above male rates in 1900, instead of almost uniformly below them as in 1930 and 1940. Since 
women are not as continuously in the labor force as men are, what is reported as ’’unemployment” 
in some instances is really ’’non-employment,” or time not in the labor force. (This factor was adjusted 
for in the male data on the basis of the 1901 Report.) 

w Data for agricultural unemployment are excluded since the chief reasons which dictated their in¬ 
clusion in later years do not apply to the 1900 data. Their inclusion would onbif reduce the 1900 aarwing a 
estimate by S21. 
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be calculated. Douglas's absolute estunates, however, cannot be used 
as is. If one follows his procedure of comparing the labor supply 
(estimated from Census of Population data) with emp]o 3 ntnent (es¬ 
timated from payroll sources) but uses the revised labor supply data 
of Daniel Carson and the revised employment estimates of the BLS 
one arrives at sharply different results for a test year.” While the ab¬ 
solute figures in Table 8 may not be compared the percentages can be. 


TABLE 8 

X7NBMPLOYMEMT IN SELECTED INDUSTRIES IN 1020 AS ESTIMATED 
FROM DOUGLAS'S, GIVENS’ AND REVISED DATA 


Me lufacturing and transporta¬ 
tion 

Labor Supply 

Employment 

Per cent 
Unemployed 

Carson—^BLS 

14,401 

14,402 

-0.6 

Douglas—Unadj.* 

10,732 

10,628 

1.0 

Givenst 

15,418 

14,761 

4.3 

Manufacturing, transportation. 




construction 




Present estimate 

16,621 

16,348 

1.1 

Douglabt 

12,221 

11,478 

6.1 


* Manufactunnff plus steam raikoads plus street railways. Data prior to arbitrary adjustment to 
Givens' unemployment percentase 

t Includes band trades in labor supply and posribly in employment as well. 

t After adjustment of manufacturing and transportation data to Givens' unemployment per¬ 
centage. 

The percentage of manufacturing and transportation combined is 
negative and by any reasonable interpretation is valueless. When 
construction is added the percentage becomes a poritive one, but its 
absolute value may still be in question. However the series may be 
reasonably indicative of the changing movement of unemployment. 
Given benchmark data for 1900 and 1920 the Douglas data may be 
used for interpolation. A regression of the present series for per cent 

» Douglas' daU from Real Wages, Chapter 24. The Carson estimates are from his IndwirieA 
Composition 0 / Manpower in the United Statea, J870-1940 (November 1046). 

The labor supply in construction was not estimated from the Carson data since he computes the 
number of construction laborers in 1920 by applying the 1930 ratio of laborers to number of workers in 
selected construction trades. However an unduly high proportion of construction workers in 1920 re¬ 
ported themselves in the skilled trades, presumably calling themselves carpenters, painters, etc., as a 
result of work in war construction. The labor supply was therefore estimated by applying to the BLS 
employment estimate the unemployment percentage used by Douglas (p. 451). 

The Givens' esUmates are from Recent Eeanomie Changes (1929). Vol. II. pp. 466-478. For most 
of his 1920 unemployment percentages Douglas uses the Givens' estimates. 

NICE estimates are likewise available for this period, and indeed back to 1900. Eeonemie Record, 
March 20,1040. These data for the 1900-1920 period, however, seem of questionable value. The 1920 
unemplosmient percentage, for example, is the resultant of a eombination of percentages for each in¬ 
dustry—the percentages secured by interpolating between a 1900 and a 1930 percentage, according to a 
communication from the NICE. The 1920 figure, however, is likely to have been less than either, and 
thin distortion serves to distort the entire 1900-30 level of the estimates. 
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of lost 1920-26 against that of Douglas’ series (for manufacturing, 

mining, construction and rail transport) was therefore computed.®^ 
This indicated a sufficiently close relationship so that the present series 
could be carried back to 1890. The resultant estimated series was then 
used to interpolate between the 1900 benchmark and the 1920 es¬ 
timate.*® The conmderation of such related mateiisds as the trend of 
deflated gross product, imemployment among union labor in New York 
and the qualitative data in Thorp’s Business Annals tended to confirm 
the verisimilitude of the movement thus indicated.®* 

The per cent of time lost for 1940-46 can be readily computed from 
the Bureau of Census data, thus providing a full series for the period 
1890-1946.®" By appl 3 ring this series to that previously computed for 
full time earnings we secure the final estimates for the actual earnings 
of nonfarm employees. The nominal earnings series together with one 
adjusted for changes in the cost of living appears in Table 2.®®>®® 


It may not be without interest to compare the movement shown 
in Table 2 with Douglas’ figures for the period when the two full time 
earnings series are independent: 1910 to 1926. Thus Douglas’ full time 
earnings figure for nonfarm wage earners rises 126 per cent from $652 


” Donglaa, op. cit., p. 460, presents estimates for the 4 specified industries for 1897-1926. His series 
was extrapolated to 1890 by its relationship, over the 1897-1926 period, to his series for unemployment 
in manufacturing and transportation (ibid , p. 445}. 

M The 1900 value estimated from the regression was 7.9 per cent, as compared to 7.5 from the Cen¬ 
sus materials. 

** The relationship between unemployment iiercentages and per cent deviations of gross national 
product from trend is dissussed in the forthcoming BIS memorandum on the 1920-29 estimates. As 
indicated there the relationship is not such that one can assume the relationship between the two senes 
during the 1920 ’b had the same slope as that in earlier years. Hence it could not be assuredly used for 
extnpolation. However the fact that the slope was like the 1920's, and not the 1930*8, did tend to confirm 
a relationship otherwise arrived at. 

For the New York data, cf. BLS Bulletin No. 109, Statutiet of Unemploytnent and the Work of 
Bmploymont Offices, 1913, p. 18. 

Bureau of the Census, Labor Force, Employment and Unemployment in the United States, 1940^ 
1040 (1947). 

The cost of living index is the regular BLS series extrapolated by a regression on Douglas' series 
Cop. eit., p. 60). It would have been possible to make some allowance for the BAE series on prices paid by 
farmers for food, prices paid for house furnishings etc.—giving these some weight as representative of 
prices paid by rural nonfann workers. However, the aggregate weight of this group is relatively small 
and the differences in movement between BAE and BLS price series relatively restricted. Therefore the 
BLS series was used as is. 

** The actual earnings series developed here was infiated by the ratio of total compensation to total 
wages and salaries for the 1920-d3 period. The relevant 1920-29 data appear in Kusnets, National 
Income and Its ComposUion, Vol. 1, p. 216. The later figures are from the Barmy of Current Business, 
July 1947. For the earlier years no adjustment is necessary. King's figures for total compensation 
having been used initially. 
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in 1910-14 to $1,473 in 1926.^® The present estimate, is necessarily 
higher in 1910-14, being for wage earners plus salaried workers, but its 
rise to $1,458 in 1926 shows an increase of 108 per cent. In part the 
difference in movement comes about because Douglas' scries, excluding 
as it docs construction, trade, finance and other groups, is not adapted 
to allow for the effect of changes in the industrial, occupational and sex 
composition of the labor force which result when these groups are 
included. But discussion of the difference should not be allowed to 
obscure the fact that the present estimate of change over this period is 
within 15 per cent of Douglas'— a. variance easily explained by the 
factors noted. 

The change in unemployment percentages over the 1910-1926 pe¬ 
riod is, of course, almost identical in magnitude, the movement of 
the present series being based on Douglas' for most of these years. 
(Douglas'—^for manufacturing, mining, transport and construction— 
falls by 22 per cent, while the present one for all nonfarm declines by 
21 per cent.) It is hardly surprising, therefore, that the net change in 
real earnings which Douglas shows for those 4 industries is very similar 
to that shovrn for all employees in the present estimates, 133 per cent 
in one instance, 112 per cent in the other 


« Doui;las, op nt , Table 147, p. 300. 

M Ibid., pp. 460, 468. With further reference to the similarity, in a general sense, it should be noted 
that materials on full time earnings and employment drawn upon for use in this present study deiive in 
no small port from the prodigious and invaluable laboxs of Douglas. 

The absolute levels of unemployment percentages are, of course, not comparable: Douglas' 
apply to only part of all wage earners, albeit the major part. As he bos emphasised there is no separate 
labor supply for manufacturing since thcie is an iniexchange between its wotkeis and those in construc¬ 
tion, tiansport, etc. (ibid, p. 002). It follows, theiefore, that unemployment percentages ^ould be 
computed over as wide a base as possible—^preferably all nonfarm workers, as is done in the present 
study. 



DIRECT DETERMINATION OF COMPASS SETTINGS 
FOR PROPORTIONATE AREA PIE-CHARTS* 


Hebman Lasbjsn 
Inglewood^ Calif. 


This note presents a chart which can be used by draftsmen 
to determine directly from raw area data compass settings for 
plotting proportionate area circles without any of the usual 
square-root and proportioning computations. 


AVOIDING here any discussion of the validity, from a purely statis- 
XX ^i<5al standpoint, of proportionate area pie-charts as a means of 
presenting statistical data, the fact remains that such charts are fre¬ 
quently used and that their preparation involves some computations. 
Regar^ess of the order in which the computations are made, the de¬ 
termination of the compass settings which will achieve proportionate 
areas of circles requires extraction of square roots and division of all 
such square roots by the square root of the circle taken as the standard 
of measurement for the particular chart. 

The accompanying chart is designed to eliminate all such computa- 
ions in the drafting of pie-charts. Following the order of the instruc¬ 
tions which are included on the chart: 

Step 1 reduces the given area data to two digits as a maximum, 
which is more than sufficient accuracy for this type of chart. 

Step 2 amounts simply to a square-root computation. The heavy 
curved line is the curve of square roots corresponding to the area scale, 
with the vertical scale set at values of zero to ten. (The scale is omitted 
from the chart as it is unnecessary for the purpose of the chart and 
simply adds confusion.) 

Steps 3 and 4 determine the compass setting for the largest circle 
on the chart and automatically select the oblique lettered line to be 
used for all remaining compass settings. 

Step 5 instructs the draftsman to draw his biggest circle, with his 
compass as set in step 4. 

Step 6 provides for similar settings and plottings for the remaining 
areas. All settings are to be made from the zero line to the oblique line 
determined as above. 

The original chart is a ten-inch square, principally for convenience 
of use. However, its use is not affected by changes in size. 


*T1i.tam ^^ebsrt’uundintUfiiot., ratentooinl.disrt.,bothwhdeaadMginaited. 
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STANDARD CHART FOR DIRECT DETERMINATION OF COMPASS SETTINGS FOR 
PROPORTIONATE AREA PIE-CHARTS 
iisnracTKws fw ose 

I. rOIKT OFF IIVCN nn oata to two fuces foi the uiiest taloc* 

0. OETCRNIHE point on CONVE CONNCSPONOINO VO LAR0E3T AREA. 

S. SET ORE POINT OF COMPASS ON 2ER0 LIRE RORIZOHTAI TO POINT SO OETERNIREO. 

«. SET SECONO COMPASS POINT ALONO SAME NORIZONTAL ON LETTERER LIRE NEAREST RAOIOS OF 
LAROEST CIRCLE DESIRED IN TIEN OF OENENAL SIZE ARD UYOOT OF CHART. 

S. NOTIRO LETTERED LINE SO DETERMINED, PLOT CIRCLE ON CNART WITH COMPASS AS SET. 

0 . ALL OTNER SETTINOS ARC THEN MADE FROM TRE ZERO LINE TO THE SAME ICTTCRED LIRE ALONO 
HORIZONTALS DETERMINED FROM THE POINTS ON TRE CNOTE CORRESPONOIHA TO THE OITEN AREAS. 


AREA 

O 10 20 30 40 SO 60 70 80 90 100 



O 10 20 30 40 50 60 70 80 90 lOO 


AREA 

Experience with the chart shows that most draftsmen require a single 
demonstration of its use. Once familiar with it, they use it with facil¬ 
ity and eliminate both the usual computations and the possibility of 
error inherent in such computations. 








PROFILE GRAPHS 


John V. Spihlmans 
Marquette University 

This article is concerned with certain graphical devices, 
called “Profile Graphs,” for the integrated presentation of 
data of several dimensions. The several types of such profiles 
(two-and three-dimensional, simple and composite, statistical 
and historical) are explained; their advantages and disad¬ 
vantages discussed; and illustrations given for the several 
types. 

T his article is concerned with some graphical devices for the pres¬ 
entation of statistical data of several dimensions. Although the 
device is not new^ we feel justified in discussing it here, for two reasons. 
On the one hand, the method, its considerable merit notwithstanding, 
seems little known or at least, little used.^ On the other hand our analy¬ 
sis of the types of relationships which lend themselves to this kind of 
presentation serves, we believe, to widen the scope of the method. 

We begin with a brief discussion of the nature of multi-dimensional 
relations. By “dimensions” we mean data of distinct denomination 
which combine by multiplication into a further significant datum. A 
two-dimensional relation is thus characterized by three data, one of 
which is the product of the other two: (or inversely, one of 

which is the quotient of the other two: m=A/n); a three-dimensional 
one by four data one of which is the product of the other three: 
and so on. 

Many basic relations in the field of economics, as well as in other 
fields, fall into this class. To enumerate a few: value equals quantity 
times price; payrolls equal number of employees times average pay; 
daily wages equal hourly rates times hours per day; man-days of work 
(or of strike) equal number of workers times average days of work (or 
of strike); passenger-miles of traflic equal number of passengers times 
average mileage, and so on. 

This variety of multi-dimensional relations can generally be reduced 
to a common type. In all such instances the data refer to classes of 
elements pertaining to given time intervals. In two-dimensional rela¬ 
tions the one factor, say n, represents the number of elements in the 

i K. Karsten in Charts and Graphs, New York, 1925, p. 613 ff. describes certain aspects of it under 
the name Area Bar Charts, although, as we believe, without full appreciation of their significance. 

* The present author hit upon a scheme of this kind in trying to give an integrated graphical 
presentation of strike data. See “Strike Profiles,* Journal cJPotilieal Seonomy, Dec,, 1944, pp. 319 ff. 
He has not seen it used elsewhere. 
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class during the time interval in question; (number of unit goods pro¬ 
duced, or sold, or consumed; number of acres planted; number of per¬ 
sons doing a certain thing; number of certain events occurring per 
year, or per month); while the other factor m gives a numerical attri¬ 
bute of the elements (prices of goods, productivity of workers, or of 
acres; duration of actions or of events). The product A^n-m then 
represents the total value of the attribute pertaining to the class of 
n elements as a whole (total value; total product; total earnings; 
total man-days, etc.) 

In most instances, it is true, the attribute m is not constant, but 
varies among the n elements in the dass. In that case the aggregate 
value of A is obtained, not simply by multiplying, but by the summa¬ 
tion of the »?i-values. 


ri. *= m. 

In terms of this aggregate, however, we can define an “average” or 
“mean value” of m; 


_ 1 » A 

m = — 2^mi ’= — > 
n i_i n 

wherefore A can again be presented as a product of two factors, 
=n‘W. 

Three-dimensional relations can similarly be reduced to a common 
type in terms of a class or classy of elements: the whole class in ques¬ 
tion contains p subclasses of q (or ^ elements each, so that the total 
number of elements in the class is n=p-q. If m (or in) is the numerical 
attribute of the elements, then A=‘n-m<=p-q-m is the aggregate 
«i-valuc of the class os a whole. (For example, p acres of average pro¬ 
ductivity q bu/acre produce n=p g bu; with the price M cts/bu, the 
total value of the crop for the given year is A^p-q-M cts.) 

OBAPHICAL BEPBESENTATION OF MUnTI-DIMENSIONAL BEIATIONS 

The customary graphical representation of more-dimensional data 
is by way of multiple graphs: triplets of line or bar graphs in the same 
or in adjacent di^ams for the three data n, m, and A. (For example 
for quantity, price, and value of commodities, or for employment, 
average earnings, and payrolls, and the like.)* 

* Since the viewpoint in sudi presentations does not necessarily coincide with the one taken here 
sadi multiple graphs do not always show all, or not only, the (n+l) data belonging to an n-dimensional 
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Such multiple graphs, however, while showing clearly the data and 
their time variations separately, fail to give visual expression to the 
dimensional relation between them. 

In contrast the profile graphs represmit all the data which enter 
into the relationship in a single, dimensionally integrated figure. 

A. Profiles of two-dimensional relations. To represent two-dimen- 
donal data one plots the two factors n and m respectively* as the width 
and height of a rectangle, whose area then represents the product 
datum A. Constructing such rectangles for successive time intervals, 
and placing them ade by ade, each with its proper “time label,” one 
obtfdns what we call a “statistical profile.” Such profiOle shows in a single 
figure the three time series n(t), m(t), and A{£) “dimensionally in¬ 
tegrated,” the datum A(t) appearing visually as the product resulting 
from the multiplication of the two distinct data n(t) and m(t). 

The economy of this presentation is due to the fact that both axes 
of the diagram are utilized for the two dimensions n and tn of the rela¬ 
tion in question, while none is used as a “time axis.” Chronological 
time is merely represented by the lime label? of the several rectangles. 

Against the advant^es of this compact and integrated presentation 
stand certain disadvantages. First, the ninnerical values of the areas 
are less easily read and compared than in line or bar graphs showing 
the values of A separately. This disadvantage can be offset largely by 
writing the values of A inside the rectangles. (See Fig. 1.) Secondly, 
the time variations, not only of the areas, but also of the widths of the 
rectangles stand out less clearly than on graphs showing each variable 
separately. Again, the profiles are at a disadvantage in comparison to 


rdafaonship. Thus the Bureau of Labor Statistics shows cuatomarily in one diagram the indices of em^ 
ployment and payrolls, without the implied average earnings, while the National Industrial Conference 
Board {StudUs of American Wages) shows in one diagram the indices of employment and average earn¬ 
ings, without the total payrolls Likewise Babson’s Bubinesa Reports shows in multiple line graphs the 
quantities and orices of commodities, but not the implied total values; whereas other presentations will 
show doublets of line or bar graphs for quantity and total values (e.g. of crops), without the implied 
price. Again one finds multiple graphs of data belonging not to one. but to several two-dimensional 
idations. Thus the N.I.O.B. in its Weekly Chart Service, Road Maps to Industry, shows in one single 
diagram 15 line graphs of manufacturing indices, all of which can be grouped into triplets of two-di¬ 
mensional data. On the other hand these and similar publications also show the exact triplets of two- 
dimensional data. For instance, the Road Maps to Industry show triplets of line graphs for indices of 
volume, price, and value of goods sold; of payrolls, employment, and average earnings; of number of 
strikes, workers involved, and average workers per strike, etc. Similarly in the Survey of Current Busi¬ 
ness (U. S. Department of Commerce) triplets of line graphs appear in adjacent diagrams for weekly 
hours, hourly earnings, and weekly earnings; for quantity, price and value of production. Since the 
latter part of 1937 the N.I.C.B., loe. cit., separates the 15 indices of Manufacturing Changes into two 
groups called respectively Basic and Derived Indices. In our designation the former are the data n and 
A, the latter the data in »A/n. While the making of this distinction draws attention to the dimensional 
rdation between the data no visual expression is given to them in the graphs. 

«In the foUowmg we axe using m as syxmnymous with M, unlesB the distiaction is essentiaL 
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BITUMINOUS COAL PRODUCTION IN THE UNITED STATES, 1900-39 
FIVE-YEAR AVERAGES; INDEXES, 1929 «100 
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Source: National Bureau of Economic Research, Barger and Schurr, 
The Mining Industries, 1899-1939, Table A-8. 



Note: Since the data presented in this figure vary on the whole but slowly a year-to- 3 ^r profile would 
be impractical. But by presenting five-year averages a distinct and telling picture of the development 
as a whole results; the steadily inereasing productivity per man-hour, combined with an imtially strong 
increase, and later decrease of man-hours, resulting, after a prolonged increase, in an eventual sharp 
decrease in total output. 


line graphs whenever the time variations of the data are only small per¬ 
centages of the whole.® However, even under such circumstances it is 
often possible to obtain worthwhile profiles by constructing the several 
rectangles for larger time intervals. The objection has also been made 
that profile graphs are difficult to understand. This objection, we be¬ 
lieve, can best be met by proper legends on the graphs showing clearly 
the meaning, units of measurement, and scales for all the data involved. 

To siunmorize, the profiles are useful insofar as the advantages of ^ 
the dimensional integration outweigh the several disadvantages. More¬ 
over, one can always show in separate diagrams any one of the variables 
which docs not stand out with desired clarity in the profile. 

B. Proflies of Ihree-dirrunsmud reVdiom. For relations of the type 
A^Tp-q-m one can utilize the third space dimension and plot p, q, 
and m respectively as the length, width, and height of rectangular 
solids, whose volumes then represent the product A. In addition the 


s In a line graph one can show the smallest percentage changes as large as one pleases by simply 
using a large enough scale, and placing the sero level info the unseen depths below the diagram. In 
a profile, however, where the whole amounts from sero up are repreeented, peroeutage variations can¬ 
not be made to look large when in fact they are small. Hence in such situations, where a line graph may 
be seen to rush dramatically up and down, a profile would offer the drab sight of a sequence of practi¬ 
cally equal rectangles, ginng, it is true, the correct information as to practically unc h ang in g values, 
but unable to exhibit to the eye the small variations which do take place* 
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axeas of the faces of the solid show the partial products p-q, q-m, 
and p-m. Of these the first, and frequently also the second are them¬ 
selves significant data, p-q=^na& the total number of elements in the 
whole class, and q-m as the aggregate m-value per sub-class. 

This presentation is very compact, being capable of showing up to 
six significant data dimensionally integrated in a single figure. (See 
below Fig. 3.) On the other hand the disadvantages noted in the two- 
dimensional profiles pertain a fortiori to the three-dimensional ones, 
since the volumes of solids arc still less readily compared than the 
areas of rectangle. Hence a separate presentation of the product A 
may often be desirable. 

To avoid these somewhat cumbersome solids one can represent three- 
dimensional relations in merely two space dimensions, by means of 
“ruled rectangles.” We first construct a rectangle whose width is the 
partial product n=p-q. With m for its height its area represents the 
three-dimensional product A=p q-m. Next we divide the width n 
into a number of segments equal or proportional to p, whereby each 
segment has a length proportional to n/p=q. At the points of division 
we rule the rectangle by parallel lines ioto strips, whose T^ddth is thus 
proportional to q. 

Such “ruled profiles” exhibit, properly integrated, all the significant 
data entering into three-dimensional relations. Although likely to be 
less accurate in the presentation of the factors p and g, they have the 
advantage of being more easily read and interpreted than the more 
complex solid figures. 

Figures 1 and 2 show examples of profiles of two-dimensional rela¬ 
tions; Figure 3 of a solid, and Figure 4 of a ruled profiOie of three-di¬ 
mensional relations.* 

C. Composiles.’’ The profile method appears to be of special advan¬ 
tage where the question is to represent not merely the data of a class 
(n, m, A) or (p, g, m. A) as a whole, but the distribution of these data 
over a number of component classes into which the whole class is di¬ 
vided. 


* Amonic the many other data which can advantageously be presented as two-dimensional profiles 
we mention: volume of crops (“-acreage Xbu/acre>; earnings of certain classes of wage eamers (“-work¬ 
ing timeXwage rate); workers involved in labor board cases (-^number of cosesXworkersXcase); 
passenger-miles or tor-miles of traffic («number of passengers or tonsXaverage mileage). Of three- 
dimensional relations annual man-hours in an industry ( -“average employment Xdaily hours Xaverage 
working days in the year); annual earnings ( -“daily hours Xwage rate Xaverage working days in the 
3 rear.) Both these are of great interest in connection with the question of a guaranteed annual wage. 

7 The figures here described as *oompo«tcs’' are the only type with which Karsten (op. eif.) 
illustrates his "Vertical Area-Bsr-Charts”; neither the two and three dimensional, nor the ruled time- 
series profiles are given or described. 




Note: This profile integrates the simultaneous influences of the three separately varsring dimensions: rapidly expanding acreage (width), and increasing productivity 
per acre (depth), together resulting in rapidly increasing total crops (shaded top faces); and abruptly fluctuating price (height), giving rise at times to smaller values 
per acre (shaded lateral faces), but steadily growing total values (volumes of solids). 



FIGURE 4 


EMPIX)YMENT AND EARNINGS IN TOBACCO MANUFACTURING 
UNITED STATES, 1899-1939 

Agsregftte Wages Number of Wage Earners XAverage Annual 
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Source: U. S. Census of Manufactures. 1939. II. 1, p. 270 

Note: In this “ruled profile” several employment data pertaining to the tobacco industry are shown 
dimensionally integrated: total number of wage earners, (width), total wages (area); average annual 
earmngs (height) number of establishments to nearest thorsand (number of internal strips); average 
number of workers per establishment (width of strips). The profile thus shows at a glance the develop¬ 
ment from most poorly paid production In many very small establishments towards larger scale pro¬ 
duction in ever fewer establishments, employing m toto fewer workers at higher, but still very low 
average wages. 


The customary presentation of a distribution of data over com¬ 
ponent classes is dther by way of compound bars; or by circular charts 
showing the share of each component as sectors of the full circle; or, 
where the time variations of the several components are of principal 
interes t, by way of compound line graphs showing the components 
added vertically onto each other. To present in such manner the trip¬ 
lets of data of a two-dimensional relationship distributed over lb com¬ 
ponents one would need three separate lb-fold compoimd line or bar 
graphs; or pairs of circular charts, for the presentation of the n and A 
data, combined with a bar graph for the m data. And at that these mul¬ 
tiple compound graphs would fail to show the dimenrioual interrela¬ 
tionships between the particular triplets (n^, mt, At) bdonging to one 
same component class. 

Under the dimensional scheme one combines such triplets for each 
component class into one rectangle, and places all k such component 
rectan^es side by side. The figure so obtained eonristing of k com¬ 
ponent rectangles all of which belong'to one conunon time interval, 
we call “compoffltes,* (in contrast to the “profiOies* which consist of 
rectangles belon^ng to successive time intervals). 
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In composites the disadvantages of the dimensional presentation 
are far smaller than in the profiles, because they concern a distribution 
of values at a given time only, not their variation in time. On the other 
hand the advantage of showing all k triplets of data in their proper 
dimensional relationship in a single figure is very considerable. 

Finally one can construct “composite profiles,” that is, sequences 
of composites for successive time intervals, which are capable of 
^ving a very compactly informative view of the chan^ng aspects of 
complex phenomena.’ 

The three inures below show illustrations of various types of com¬ 
posites; Figure 5 of a simple distribution of two-dimensional data; 
Figure 6 of a special kind, which we call “frequency composite”; 
Figure 7 of a sequence of “ruled” composites.* 


FIOTTBB 6 

RAILROAD TRAVEL, BY CLASSES OF PASSENQERS 
UNITED STATES, 1941, 1943 

l^enger>Miles ^Passengers XAverage Mileage 
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A. Commuters 
B Coach Passengers 
C. Parlor. Sleeping Car Passengers 

Source: Statistics of Kailwaya in the IT. S. 1943. Tables 44. 52 

Note: This figure shows data for railroad traffic: number of passengers (width), average 
(height), and passenger-miles (area) distributed over three classes of passengers. The several influences 
of wartime conditions are seen, dimensionally integrated: the increased number but constant 
of the commuters; and the increase, both in number and mileage of actual travelers. 
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* See "Strike Profiles.” loe. cH . figs. 2, 3. 4, 5. 

* Other data which can effectively be presented as composites are; volumes of crops (acreage 
Xbushel/acre) distributed over various geographical regions, states, or countries; values of crops 
(acreage Xvalue per acre) distributed over various crops; payrolls (workers Xaverage pay) distributed 
over different industries, or different classes of personnel; traffic data, as in Figure 6. distributed over 
various m ea n s of transportation (railroads, airplanes), and many others. 
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FIGURE 6 

CONSUMERS. INCOME, BY INCOME LEVELS, UNITED STATES 1935 
Aggregate Income ^Consumer Units XAverage Income 
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Gouico: U. S. Statistical Abstract, 1942, Table 384. 
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Note: The special feature in this *fmqueney composite” »that the component classes (income levels) 
are simply magnitude classes of the dimension (average income) itself. Consequently the data con¬ 
stitute essontiplly a frequency distnbution, the width (n») showing how many elements have lvalues 
in the interval --iiK..}). (the heights of the two adjacent rectangles). However this composite 
shows additionally also for each income level the aggregate income (area X^), and the average income 
(height aU), and thus gives more information than ordinary frequency graphs, simple and cumulative. 


D. Historical Profiles. Many apparently one-dimensional line or bar 
graphs of time series are really two-dimensional profiles, althot^ of 
a different kind than those discussed above. 




PIG, 7. STRIKES BY ISSUES, UNITED STATES, SELECTED YEARS 
Man-Days Idle *» WorkeiB Involved XMean Duration of Idleness 
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As an example let us take a lino graph showing as its ordinate y a 
labor force varying in time t plotted along the horizontal time axis, 
(Fig. 8). Let Lt, in days, be a time interval during which y is sufficiently 
constant. Then the product y-A2=AA, which is the strip of area under 
the curve standing over M, gives the number of man-days of employ¬ 
ment. Hence the total area under the curve between two given dates U. 
and ti, 

h 

A »=» ^y'At 

h 

gives the total of man-days of employment during the time ft—fi. 
The figure, therefore, is a profile showing three dimensionally related 
data: the number y of workers employed at any time t; the duration Af 
of cmplo 3 ancnt of y men, and the man-days of employment A. 

In other instances the height y of a line or bar graph represents, not 
as in the above case, a magnitude which endures in time, but a number 
of occurrences taking place and accumulating per time unit, that is, a 
time rate, AiV /At (c.g. the number of births per year; or the number 
of strikes per month; or the number of workers quitting their jobs per 
month, or per week, etc.). In such cases the area under the curve 

h h AAT e 

A = 2 “ 2 -=■ £ ^ ^ 

*i ii Af (I 

represents the total number of occurrences during the pmod ft—fi. 
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Hence again the graph is truly a profile representing three dimensionally 
related data: the time rate the time Ai over which the rate prevails, 
and the total number of occurrences during the time interval in ques¬ 
tion.^® 

Profiles of this kind, in which chronological time is represented by a 
time axis, we call ‘‘historical profiles,” in contrast to the statistical pro¬ 
files discussed above. 

A special form of historical profile results when the phenomenon 
represented is intermittent rather than continuous, that is, occurs at 
certain times, lasts for a while, and ceases again. The profile then con¬ 
sists of separate, quasi-rectangular figures along the time axis, their 
place showing the time of occurrence, their height the “magnitude,” 
their width the duration, and their area the “time integral” of each 
particular occurrence. 

Profiles of this kind are useful when individual occurrences rather 
than statistical summaries are important. For example, where indi¬ 
vidual strikes rather than overall strike statistics are of interest—as 
are, indeed, from the standpoint of management the strikes in its own 
establishments—“historical strike profiles” could be used to con¬ 
siderable advantage. Any one strike would be sho^vn at its proper place 
along the time axis by plotting vertically the day-to-day number of 
workers involved, whereby the area would give the number of man- 
days idle. Such profiles show at a glance when, how often, and for how 
long how many men were out, and how many man-days were lost.^^ 

On similar principles, only on a time scale ranging over centuries 
rather than months and years, one can construct very instructive “his¬ 
torical war profiles,” for individual nations or groups of nations, by 
using for the height the absolute or relative “number of men involved,” 
that is, the size of the armies, as a rough yardstick for the magnitude of 
wars (at least in the pre-atomic era), while the width of the rectangles 
shows the duration, and the areas the “man-years” of wars. 

10 These dimensional ndations are familiar in the field of physics: Two examples analogous to the 
situations discussed above are: when a force F is plotted as a function of time t, the area under the 
curve, represents the Impulse; and when a velocity v ^ds/dt is plotted as a function of time 
the area under the curve/f^edt represents the total distance st—si traversed during the time interval 
t»—-ti. 

u To render such profiles more informative one could show further, say by distinctive coloring, 
the principal causes of the various strikes; through shaded extensions of the stnke rectangles one could 
show the number of workers made indirectly idle through the strike. To allow the representation, in 
one same diagram, of large and of small strikes the vertical scale could be made logarithmic. 



A METHOD FOR OBTAINING AND ANALYZING 
SENSITIVITY DATA* 


W J Dixon 
Umveriitty of Oregon 

AND 

A M. Mood 
Iowa State College 

The slandaid method of deahng with sensitivity of dosage- 
mortahty data is the probit techmquc developed by Bhss and 
Fifaher. This paper provides an alternative tcchmque based on 
a special system for obtaimng sueh data It has some ad¬ 
vantages when observations must be taken on individuals 
rather than groups of individuals, and it may bo preferred in 
ccitain other situations 

INTRODUCTION 

E xperimental investigations often deal with continuous variables 
which cannot be measured in practice For example, in testing the 
sensitivity of explosives to shock, a common procedure is to drop a 
weight on specimens of the same explosive mixture from various 
heights. Theie are heights at which some specimens will explode, and 
otheis will not, and it is assumed that those which will not explode would 
explode were the weight dropped from a suflBiciently greater height. It 
is supposed, therefore, that there is a critical height associated with 
each specimen, and that the specimen will explode when the weight is 
dropped from a greater height and will not explode when the weight 
is dropped from a lesser height. The population of specimens is thus 
chaiaclciized by a continuous vaiiable—^the critical height—^which 
cannot be measured. All one can do is select some height arbitiarily 
and del ei mine whether the critical height for a given specimen is less 
than or gi eater than the selected height. 

This situation aiises in many fields of research. Thus in testing insec¬ 
ticides, a ciitical dose is associated with each insect, but one cannot 
measure it. He can only try some dose and observe whether or not 
the insect is killed, that is, observe whether the ciitical dose for that 
insect is less than or greater than the chosen dose. The same difl&culty 
arises in phaimaceutical research dealing with germicides, anesthetics, 

* This paper la in part an adaptation of a momoiandum submitted to the Applied Mathematics 
Panel by the Statistical Research Group Fkinceton University The Statistical Research Group oper¬ 
ated under a contract with the OfGice of Scientific Research and Development, and was directed by the 
Applied Mathematics Panel of the National Defense Research Committee. 
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and other drugs; in testing strength of materials; in psycho-phydcal 
research dealing with threshold stimuli; and in several areas of biologi¬ 
cal and medical research. 

In true sensitivity experiments it is not possible to make more than 
one observation on a given specimen. Once a test has been made the 
specimen is altered (the explosive is packed; the insect is weakened) 
so that a bona fide result cannot be obtained from a second test. The 
common procedure in experiments of this kind is to divide the sample 
of specimens into several groups (usually but not necessarily of the 
same size) and to test one group at a chosen level, a second group at a 
second level, and so on. The data consist of the numbers affected and 
not affected at each level. A method of analyzing such data (variously 
called “senativity* data, “all or none" data, “quanta! responses") has 
been developed by Bliss and Fisher [references 1, 2], and discussed by 
other writers [3, 4, 5, 6]. 


A new technique for obtaining senativity data has been developed 
and used in explosives research. The authors became acquainted with 
this new method in 1943 at the Exploaves Research Laboratory, Bruce- 
ton, Pennsylvania. It has come to be called the “up and down” method. 
The method may be employed in any sensitivity experiment, but we 
shall discuss it in terms of the explosives to avoid general terminology. 

The technique is to choose some initial height ho, and a succession 
of heights hi, ht, As, - ■ * above ho together with a succession h-i, A_s, 
h-o, • ■ * below ho. The first specimen is tested by dropping the wei^t 
from height ho- If the first specimen explodes, the second specimen will 
be tested at h_], otherwise the second specimen wiU be tested at hi. 
In general, any specimen will be tested at the level immediately below 
or immediately above the level of the previous test according as there 
was or was not an exploaon on the previous test. The result of such an 
experiment might be portrayed as in Figure 1 where the z’s represent 
explosions and the o’s non-explosions. The first test is on the left at the 
hipest level; this was a success (explosion) so the second test was made 
at the next lower levd and was also a success; the third test was there¬ 
fore made at the level below that of the second and since it was a failure 
the fourth test was made at the level above that of the third test. 

The primary advanta^ of this method is that it automatically con¬ 
centrates testing near the mean. We shall see later that tlus increases 
the accuracy mth whidi the mean can be estimated. Or in other words, 
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BEOORD OF A SAMPLE OF SIXTY TESTS 

Normalised 

Height 


Number of 
x’s o*s 


2 0 X 1 

17 XXX XXX X XXX 10 

14 oxox xxxxooxoxxxoxxxoxxoxxox 18 9 

11 O XOOOOO 0 000 000 00 00 X0 2 18 

0 8 o o 2 


FIGUBE 1 

for a given accuracy the up and down method will require fewer tests 
than the ordinary method of testing groups of equal size at preassigned 
heights. The saving in the number of observations may be of the order 
of 30 to 40 per cent (see Appendix A). 

Another advants^ is that the statistical analysis is quite simple in 
certain circumstances whereas the analysis for the ordinary method is 
rather tedious. 

The method has one obvious disadvantage in certrun kinds of experi¬ 
ments because it requires that each specimen be tested separately. This 
is not important in explosives experiments because each test must be 
made separately anyway. But in tests of insecticides, for example, a 
large group of insects can sometimes be treated as easily as a angle 
one, and in largo experiments of this kind any advantage of the up and 
down method might well be outwdghed by this requirement of single 
tests. Even here, if expensive laboratory animals were being used, the 
advantage in economy of tests might offset the trouble of making angie 
tests. 

CONDITIOira ON TEN KXFEBIMENT 

The statistical analysis of data obtained can be quite simple provided 
the cxpcrimmrt satisfies certain conditions. Loss restrictive conditions 
must be fulfilled in order that any analyas will be possible. These will 
be discussed here and the actual analysis will bo given in the following 
section. 

In the first place, the anal 3 >^ reqmres that the variate under 
analyas be normally distributed. In practice the variate of interest to 
the research worker can rarely be conadered to be normally distributed. 
It is therefore necessary that the natural variate be transformed to 
one which does have the normal distribution. This is readily done pro¬ 
vided the research worker has enough experience and data on his ma¬ 
terial to be able to specify rather accurately the shape of his distribution 
function. It is often the case in dosage mortality experiments and in 
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Kcperiments on explosives that the logarithm of the dosage concentra¬ 
tion or of the height is reasonably normally distributed. But in other 
areas of research, and sometimes in these areas, other transformations 
are more appropriate [7]. 

If one has no idea of the shape of his distribution function then the 
data of the experiment itself must be used to provide this information. 
The common procedure here is to compute the percentage affected at 
each level and plot these percentages on arithmetic probability paper 
against various functions of the variate in question. Usually one can 
soon discover what sort of function will force the percentages to lie 
sensibly along a straight line. There are, of course, infinitely many 
functions to choose from; the chosen fimction should be as ample as 
possible consistent with whatever knowledge is available concerning the 
nature of the material at hand. 

We have already mentioned that the up and down method is par¬ 
ticularly effective for estimating the mean. It is not a good method for 
estimating small or large percentage points (for example, the height at 
which 99 per cent of specimens explode) unless normality of the dis¬ 
tribution is assured. In fact no method which uses the normal distribu¬ 
tion can be relied on to estimate extreme percentage points because 
such estimates depend critically on the assumption of normality. In 
most experimental research, it is possible to find simple transformations 
which make the variate essentially normal in the region of the mean, 
but to make it normal in the tails is quite another matter. Nothing 
short of an extensive exploration of the distribution involving perhaps 
thousands of observations will suffice here. Bartlett [S] has recently 
presented an interesting technique for dealing with this problem. 

A second condition on the experiment is that the sample size must be 
large if the analysis to be described is to be applicable. As it turns out, 
the effective sample size is only about half the actual sample size. The 
statistical analysis is based on large sample theory so that if one uses 
the analyris on a sample of size forty, he will in effect be using large 
sample theory on a sample of size twenty. Measures of reliability 
may well be very mislea<^g if the sample size is less than forty or 
fifty. 

A further condition is necessary if the statistical analysis is to be 
simple. One must be able to estimate roughly in advance the standard 
deviation of the normally distributed transformed variate. The inter¬ 
val between testing levels should be approximately equal to the stand¬ 
ard deviation. This condition will be well enough satisfied if the inter¬ 
val actually used is less than twice the standard deviation. This require- 
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ment is not severe, for research workers who repeatedly perform these 
requirements on essentially similar materials can usually make very 
good preliminary estimates. This is the case in explosives research or 
biological assay, for example. This circumstance (of repeated experi¬ 
ments) is precisely the one in which a simple analysis is most desirable. 

STATISTICAL ANALYSIS 

The simple method of analysis given in this section is applicable only 
when all the conditions described in the preceding section are fulfilled. 
The theory underlying the method is given in Appendix A. The more 
complex analysis required when the levels are not equally spaced or 
when the distance between levels exceeds twice the standard deviation 
is ^ven in Appendix B. 

We again revert to the explosives experiment in describing the meth¬ 
od. Suppose it is known for the given type of explosive that the log¬ 
arithms of the critical heights arc normally distributed. Letting h 
represent the height, log h will then be the normally distributed 
variate. We shall call y the normalized height, and represent the mean 
and variance of its distribution by n and <r®. The experiment is per¬ 
formed by choosing an initial height for the first test, say Ao. This 
should be chosen near the anticipated mean. The other testing levels 
are determined so that the values of the normalized height y are equally 
spaced. If d is the preliminary estimate of <r, and if yo= log Ao, then the 
actual testing heights are obtained by putting log h=yn±d, yo±2d, 
yo±3d, • • •, and solving for A. The heights will then be so spaced 
that the transformed variate is equally spaced with spacing equal to 
its anticipated standard deviation. AU computations are done in terms 
of y. 

In any experiment the total number of successes will be approxi¬ 
mately equal to the total number of failures. In fact, the numW of 
failures at any level cannot differ by more than one from the number of 
successes at the next higher level. For estimating y, and v only the suc¬ 
cesses or only the failures arc used, depending on which has the smallor 
total. In the example shown in Figure 1 there are fewer failures than 
successes so the failures would be used. We shall let N denote the 
smaller total and let no, ni, nj, • • • n« denote the frequencies at each 
level for this less frequent event where no corresponds to the lowest level 
and n* the highest level on which the event occurs. We have then 
Xn,—N. 

The estimates of y and tr are based on the first two moments of the 
y values using the frequencies n,. But since the y values are equally 
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spaced, the moments are more easily computed in terms of the two 

8UIDS 

il = 23 tn* 

B == 

In this notation, the estimate of pt, say m, is 

where is the normalized hdght corresponding to the lowest level on 
which the less frequent event occurs. The plus sign is used when the 
analyas is based on the failures, and the minus sign when it is based 
on the successes. 

The sample standard deviation is 

/NB - A* \ 

8 » 1.620d(- — -+ .029j (2) 

and this, of coitrse, is the estimate of or. This is a curious estimate in that 
while it is a linear function of {NB—A^)/1P, it gives the estimate of 
the standard deviation, not the square of the standard deviation. The 
formula is an approximate one which is quite accurate when {NB 
—is larger than 0.3 but breaks down rapidly when {NB—A^/N^ 
becomes less than 0.3. In the latter instance the formula cannot be 
used and the more elaborate calculation described in Appendix B must 
be employed. 

The example of Figure 1 will illustrate the use of the formulas. Here 
the y values used were 2,1.7,1.4,1.1, 0.8; the level of the first test 
bdng 2, and d beii^ 0.3. Among the ^ty tests there were 31 explosions 
and 29 failures, hence the latter are used to estimate the parameters. 
The failures appear on three levels (0.8, 1.1, 1.4) with frequencies no 
=2, ni=18, nj=9. We have then N—2Q, A =36, B=54, so that the 
mean is 


m = 0.8 + 0.3 
and the standard deviation is 


/36 1\ 

{ —+ —) = 1.32 
\29 2/ 


/270 \ 

(1.620) (0.3) (— + .029j = .17. 
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The sample was actually drawn from a normal population with /»=1.3 
and <r<=0 2 using Mahalanobis’ [9] table of random normal deviates. 
The mean and standard deviation of the sixty observations were 1.312 
and .158 so that it was a faiily representative sample. 

Peicentage points would be estimated by m+ks where k is chosen 
from tables of the normal deviate to give the dedred percentage. Thus 
in the example, the 5 per cent point is estimated by 

1.32 - (1.645) (.17) = 1.04. 

If the y values are thought of as logarithms to base ten of actual 
heights in inches in an explosives experiment, the antilogarithms of 
estimated percentage points would be estimates of the corresponding 
points for the distiibution of h. Thus the median (not mean) value of 
h is estimated by 

antilog 1.32 => 20.9 inches 
and the 5 per cent height by 

antilog 1.04 — 11 inches. 

The antilogarithm of s does not estimate the standard deviation for 
h, however, and any computation which involves the standard devia¬ 
tion (estimates of percentage points, confidence limits) must be done 
in terms of the normalized height, and only the final result transformed 
to actual heights. 


CONFIDENCE! INTERVALS 

Ordinarily the standard deviation of a sample mean, m, is given by 
<rm=<r/-\/N where <r is the population standard deviation and iV the 
sample size. In the present case this expression must be multiplied by 
a factor which we shall call G, so that the formula for the standard 
error of the mean is 

=■ Off/y/W (3) 

and G depends on the ratio d/v and on the position of the mean relative 
to the testing levels. G is plotted in Figure 2 as a fimction of d/<r. 
The position of the mean relative to the testing levels does not affect 
G unless the interval d is large; the solid branch of the curve gives the 
value of G when the mean falls on one of the testing levels, while the 
dashed branch gives the value when the mean falls midway between 
two levels. Curves for other positions of the mean would fall between 
the two branches. 



116 


AKEBICAK STATISTICAL ASSOCIATION 




r I a V R K a 


o 1 a 3 * a/r s 

In practice a is not known and s must be used in (3) to obtain an 
estimate say, of (Tm. In the illustrative example with s=.17, we have 
d/s^il.S so that G is about 1.12. The estimate of am is therefore 

s„ = (.17)(1.12)/V^ = .035. 

A confidence interval for m may now be estimated by m±&Sm. Thus a 
95 per cent confidence interval is 

1.32 ± (1.96)(.035) or 1.25 to 1.39 

using large sample theory. For moderate values of N, it might be 
preferable to use the value of h given by the t distribution for N—1 
degrees of freedom, but it is likely that this is a minor matter relative 
to the error of using large sample theory for moderate values of N. 
Again assuming the confidence interval refers to the logarithm of an 
actual height, it gives rise to an asymmetric 95 per cent confidence 
interval fl8 to 25 inches) for the median height. 

The standard error of the sample standard deviation, say or,, is or¬ 
dinarily given by af-s/^, but in the present analysis an additional 
factor is again required. We shall write 

(T. = Ha/VN (4) 

where we have incorporated the 1/v^ into the extra factor. H is 
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plotted in Figure 2 where the solid branch gives the value of H when the 
mean falls on a level, while the dashed branch gives the value when the 
mean is midway between two levels. When d/<r is less than two there 
will be little error introduced by interpolating linearly between the 
two branches for other positions of the mean. Thus if the mean falls 
d/4 from a testing level, one may use the value of G midway between 
the two branches. For the illustrative example with d/s = 1.8, we find 
H to be about 1.24 so that the estimate of <r* is 

s. = (L24)(.17)/\/^ = .039. 

The estimate s« would be used to estimate the standard error of a per¬ 
centage point, m+fcs; the estimate would be Vsm^+iW. Thus in the 
example, a 95 per cent confidence interval for the 6 per cent point 
would be estimated by 1.04±(1.96)\/(0 35/+(l 645;2(.039)2 or 88 to 
1.20. We should mention again that the estimation of small or large 
percentage points depends strongly on the assumption of normality in 
the tails. It can easily happen that a relatively small error in this as¬ 
sumption may far outweigh the sampling error indicated by the con¬ 
fidence inteiwal, especially in the case of very extreme percentages, say 
1 per cent or 0.1 per cent. 

CHOICE OF TESTING INTEUVAL 

The curves in Figuie 2 have been extended beyond d=2cr in order 
to show what happens to the measures of precision for larger intervals. 
Curve G shows that the precision of the mean steadily decreases as d 
increases. The two branches of H show that there is an optimum spacing 
for estimating the standard deviation depending on the position of the 
mean ndativo to the testing levels. Since the mean is usually unknown, 
this infoiniation is of little practical value. 

C^irve 0 indicates that the interval should bo quite small for maxi¬ 
mum pieeision in the mean, but in practice this is not true for several 
reasons. In the first place the curves arc for expected values and essen¬ 
tially assume infinite sample sizes, and in fact very large samples are 
required to get good estimates of the mean for a very small interval. 
The estimate may be biased appreciably towaid the initial testing level 
unless the sample is very large. Secondly, a small interval may cause 
one to waste observations unless a good choice for the initial level is 
made. If a poor choice is made, many observations must be spent get¬ 
ting from that level to the region of the mean. And finally, since or is 
usually unknown, the precision of the mean must actually be measured 
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by s, and the accuracy of s becomes poor for very small intervals as 
shown by curve ff. 

All these condderations indicate that the interval should be within 
the range of about 0.5<r to 2<r, and experiments with the method support 
this conclusion. 


AfPENBIX A 

If y is normally distributed with mean /t and variance and if tests 

are made at 


y6± id t » 0,1, 2, • • • (6) 

where yt is the level of the initial test, then there will be, say, suc¬ 
cesses and m failures at yi, and the distribution of these latter variates 
is 


P(n, ot| yo) = n 


( 6 ) 


where 


Pt = I e ^ dt^l-Qi (7) 

J-t, v2ir<r 

and where K is not a function of ft and 
The estimation of ft and is based on the principle of maximum like¬ 
lihood. We shall not maxiniize (6) directly, however, because a material 
simplification in the anal}^ can be made by neglecting a small part of 
the information in the sample. It is clear that 

I tii — mi-i\ = 0 or 1 

so that either one of the sets (n^ or (mi) contain practically all the 
information in the sample. If N^’Snt and and nfMiiming 

N^M,-we may write (2) in the form 

P(n, ml yo, M - N) = K'JJ (p.fff-i)** (8) 

i 

and this is the expression which will be maximized. Even if M—N is 
not small, only a small amount of information is bring ne^ected, be¬ 
cause in this instance the initial level will have been poorly chosen and 
these neglected observations will have been spent in getting from y, 
to the re^on of the mean; they will obviously contribute little to the 
more precise location of the mean. 
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Od putting the derivatives of (8) 'with respect to n and v equal to 
zero we have the relations 



where z,- represents the ordinate of the distribution of y at yi and 
Xi=(y,—it)/<^- The expected values of the left hand sides of these 
two expressions are readily found to be zero on substituting E(n,) for 
rii. E(ni) may be determined from the relation 


If we let 


then it foUo'ws that 


Ejnt+O ^ 
ff< Pi 


too ^ 1 


*-l 

w. “ n 

& 

t >0 

i-o 

Vi 


i 

-n 


* < 0 


is 



/ “ 


F(n,) = Nwi ^ 

E 

—*00 

w.. 


( 11 ) 


( 12 ) 


The maximum likelihood estimates of ft and a are the roots, say /t 
and of equations (9) and (10). While there is no simple closed ex¬ 
pression for these roots, it turns out that they can be very closely 
approximated when d<2<r. The function 

r g(a:) z(x + d/«r) 

“ ^ ?(») P(» + d/<r) 

where u=^x+di2<r, is nearly linear in « when d<2<r. This is illustrated 
in Figure 3. Similarly 

xz(x) (x + d/<i)z{x -f d/a) 

^ ^ 9(*) P(* + d/a) 

is nearly quadratic in tt as is indicated by the graph of its first deriva¬ 
tive in Figure 3, where f^d/a. We may conclude therefore that the 
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• (u) 






0 


® X 2 3 u * 


estimates are essentially determined by the first two moments of the 
yi using the as weights. 

If we let 



1 

2 ntyi, Ai “ ^ 


(13) 

we find 







•B(A0 “ M 


(14) 

and 



- 



EiHt) — 

TOO +dV4 = V* 

2 

(16) 


The expression on the ri^t of (15) is nearly linear in c when d<2<r, 
and its linear approximation was used to determine the estimate of c 
given in equation (2). The function is plotted in Figure 4; the solid 
branch represents the function when the mean falls at one of the yi, 
and the dashed branch when the mean falls midway between two levels. 
The two branches diverge rapidly as d becomes larger than 2v. 

The variance and covariances of ^ and t are determined from the 
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second derivatives of L=log P where P is defined by (8). The ex¬ 
pected values of the derivatives are readily found to be 



Expression (17) does not vanish unless the mean falls on a level or 
midway between two levels. However, we have regarded the covariance 
as being negligible for aU practical purposes. It gives rise to a maxi¬ 
mum correlation between ^ and & of the order of .0002 when d=<r, and 
.02 when d=2<r. We have then 

v,,* = GVVW, <r,* = (19) 

approximately, where G and H are defined in (16) and (18). These are 
the functions plotted in Figure 2. 
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It is not possible to make a very satisfactory comparison between 
this method and the ordinary probit method, but the following compu¬ 
tations provide some indication of the relative efiSiciencies. Suppose 2N 
individuals are divided into five equal groups and tested at y^O, 
±cr, ±2<r. Bartlett [8], for example, shows that the variance of ^ for 
the probit analysis is about 5(.564)o^/2N, whereas for the up and down 
method the variance is a^/N which when divided by the former value 
gives 71 per cent. When the sample is tested in six equal groups at 
y=±i<f’, ±|ff, ±f<r, the ratio becomes 68 per cent. But these compari¬ 
sons are not fair unless there is conMderable uncertainty as to the gen¬ 
eral location of the mean. If the mean can be located to within, say, <r 
of its true position in advance of the experiment, then the efficiency of 
the probit method can be much improved by uting groups of unequal 
size and testing the larger groups at levds thought to be near the mean. 

APFEimiX B 

When the chosen testing interval is larger than 2<r, or when the inter¬ 
vals are of unequal size, it is necessary to solve equations (9) and (10) 
for M and or. The intervals will be of unequal size, for example, when 
the normalizing transformation is unknown in advance of the experi¬ 
ment and must be deduced from the results of the experiment itself. 
A method of trial and error is probably as good as any other for solving 
the equations. One would first choose preliminary estimates, say m and 
«, of tile roots by using equations (1) and (2) or simply by using guessed 
values. These preliminary estimates would be adjusted until the equa¬ 
tions were satisfied to the desired degree of approximation. The left 
side of (9) will be positive when the trial value of /< is too small, and 
negative when it is too large. The left tide of (10) will be potitive when 
8<^, and negative when s>&. The equation (9) is relatively insensi¬ 
tive to changes in s, while the same is true of (10) for changes in m. 

In order to fatilitate the computations, the accompanying tables of 
e/p (Table I) and g/q (Table II) are provided. For negative values of 
X, p and g are interchanged, that is 

g(x) g(—x) 

pix) qi-x) 

We shall illustrate the computation using the data of Figure 5. The 
normalized heights are .1, .9, 1.5, 1.9 as indicated in the Figure. We 
shall number the levels 0, 1, 2, 3 be^nning with the lowest level, 
^ce there are more successes than failures, the latter are used to deter- 
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min fi the estimates. A preliininary estimate of it may be obtained by 
using the average of the midpoints of the intervals weighted by the 
numbers n.; thus we shall put 

mi = 1/29 [2(1.7) + 26(1.2) + (.6)] 

^ 1 . 2 . 

A rough estimate of a- may be determined by observing that the inter¬ 
val 0.9 to 1.5 appears to contain 26/29 or about 90 per cent of the dis¬ 
tribution, hence we may use 

1.645si ^ 1(1.6 - 0.9) = 0.3 

Si = 0.18. 


Normalized 

Height 

1.9 

1.5 

.9 

.1 


RECORD OF A SAMPLE OP SIXTY TESTS 

Number of 

X*B 0*8 

XX X 3 

xxxzzxxxxzoxxxxzx xzzxxxxxozxx 27 2 

OOOOOOOOOO 00000X000000000 oo 1 26 

o 1 

FIGURE 5 


In adjusting these estimates one might bo tempted to adjust mi first 
by equation (9) and then go to equation (10) and adjust Si by using a 
good estimate of it. It turns out, however, that the job can be done 
much more rapidly by conradering both equations together. The fol¬ 
lowing computational form may be used: 


t 

n» 

A, 



XtZt 



V,) 

a» 

Pt 

"•w. pj 

3 

2 





0 

6 06 

2 

2G 




3.48 

.174 

-9 05 

1 

1 




-.174 

-3.48 

3.48 

0 




■■■ 

0 







2 09 



1.30 


Note that the table is arranged so that the frequencies of either the 
zeros or ®’s will bo entered in the table as though they were a:’s. The 
s 3 nnibol x, represents (h,—mi)/si where hi is the height and mi and Si 
arc the first approximations to it and v. The other computations are 
defined by the column headings. Thus the figure 4.17 at the top of the 
fifth column is obtained as 2(2.084—.000); 2.084 being read from Table 
II at *=1.67, and .000 being the value of z/p at a:=3.89 as shown by 
Table I. The sums, 2.09 and 1.39, of the fifth and dghth columns give 
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the values of the left hand ddes of equations (9) and (10) respectively; 
since both sums axe positive. We conclude that both mi and si are too 
small Using 1.3 and 82«>.19 we repeat the above calculation: 


i 

ni 

hi 

*1 


av^i 


/ aew \ 


Qi 

Pi 


3 

2 

1.9 

3.16 

3.12 


.01 

3.28 

3 

26 

1.5 

1.05 

-5.85 

1.64 

.282 

-9.75 

1 

1 

.9 

-2.11 

-2.47 

-.093 

-5.21 

5.21 

0 


.1 

-6.32 


0 







5.20 



-1.26 


TABLE I 

VALXmS OF t/p 


X 

.00 

.01 

.02 

.03 

.04 

.05 

.06 

.07 

.08 

.09 

0.0 

0.798 

0.792 

0.785 

0.779 

0.773 



0.754 

0.748 


0.1 

0.735 

0.729 

0.723 

0.717 

0.711 


0.699 


0.687 

0.681 

0.2 

0.675 

0.669 

0.663 

0.657 

0.652 

0.646 



0.629 

0.623 

0.3 

0.617 

0.612 

0.606 

0.600 

0.595 

0.589 

0.584 

E S3 

0.573 


0.4 

0.562 

0.556 

0.551 

0.546 

0.540 

0.535 


0.525 

0.519 

0.514 

0.5 

0.509 





0.484 

0.479 

0.474 

0.469 

0.464 

0.6 

0.459 

0.454 

0.449 

0.445 

tss 

0.435 

0.430 

0.426 

0.421 

0.417 

0.7 

0.412 

0.407 

0.403 

0.398 

tiii 

0.389 

0.385 

0.381 

0.376 

0.372 

0.8 

0.368 

0.363 

0.359 

0.355 

0.351 

0.346 

0.342 

0.338 

0.334 

0.330 

0.9 

0.326 

0.322 

EEiO 

0.314 

0.310 

0.306 

0.303 

0.299 

0.295 

0.291 

1.0 

0.288 

0.284 

0.280 

0.277 

0.273 

0.269 

0.266 

0.262 

0.259 

0.255 

1.1 

0.252 

0.249 

0.245 

0.242 

0.239 

0.235 

0.232 

0.229 

0.226 

0.223 

1.2 

0.219 

0.216 


0.210 

0.207 

0.204 

0.201 

0.198 

0.195 

0.193 

1.3 

0.190 

0.187 


ESSi 

0.179 

0.176 

0.173 

0.171 

0.168 

0.165 

1.4 

0.163 



0.155 

0.153 

0.150 

0.148 

0.146 

0.143 

0.141 

1.5 

0.139 

0.137 

0.134 

0.132 

0.130 

0.128 

0.126 

0.124 

0.121 

0.119 

1.6 

0.117 

0.115 


0.111 

0.110 

0.108 

0.106 

0.104 

0.102 

0.100 

1.7 

0.098 

0.097 

0.095 

0.093 

0.092 

0.090 

0.0S8 

0.087 

0.085 

0.083 

1.8 

0.082 

0.080 

0.079 

0.077 

0.076 

0.074 

0.073 

0.072 

0.070 j 

0.069 

1.9 

0.068 

0.066 

0.065 

0.064 

0.062 

0.061 

0.060 

0.059 

0.058 

0.056 

2.0 

0.055 

0.054 

0.053 

0.052 

0.051 

0.050 

0.049 

0.048 

0.047 

0.046 

2.1 

0.045 

0.044 

0.043 

0.042 

0.041 

0.040 

0.039 

0.038 

0.038 


2.2 


0.035 

0.034 

0.034 

0.033 

0.032 

0.031 

0.031 

0.030 


2.3 


0.028 

0.027 

0.027 

0.026 

0.025 

0.025 

0.024 

0.024 

0.023 

2.4 


0.022 

0.022 

0.021 

0.020 

0.020 

0.019 

0.019 

0.019 

0.018 

2.5 

0.018 

0.017 

0.017 

0.016 

0.016 

0.016 

0.015 

0.015 

0.014 

0.014 

2.6 

0.014 

0.013 

0.013 

0.013 

0.012 

0.012 

0.012 

0.011 

0.011 

0.011 

2.7 

0.010 

0.010 

0.010 

0.010 

0.009 

0.009 

0.009 

0.009 

0.008 

0.008 

2.8 

0.008 

0.008 

0.008 

0.007 

0.007 

0.007 

0.007 

0.007 

0.006 


2.9 


0.006 

0.006 

0.005 

0.005 

0.005 

0.005 

0.005 

0.005 


3.0 


0.004 

0.004 

0.004 

0.004 

0.004 

0.004 

0.004 

0.003 

0.003 

3.1 


0.003 

0.003 

0.003 

0.003 

0.003 

0.003 

0.003 

0.003 


3.2 


0.002 

0.002 

0.002 

0.002 

0.002 

0.002 

0.002 

0.002 

0.002 

3.3 


0.002 

0.002 



0.001 

0.001 

0.001 

0.001 


3.4 


0.001 

0.001 



0.001 

0.001 

0.001 

0.001 


3.5 


0.001 

0.001 


0.001 

0.001 

0.001 

0.001 

0.001 


3.6 
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These results show that the roots are bracketed, and good estimates of 
H and ff may be obtained by interpolation between the sum a Inter¬ 
polating between 1.2 and 1.3 using 2.09, 0, -5.20, we find mj=1.23, 
and similarly S 3 = .185. By doing two more calculations similar to the 
two illustrated above, one would verify the third figures in tn and s 
and obtain good estimates for the fourth figures. Here, the results 
to three figures are jra= 1.21 and s = .187. However, the data do not 


table n 

VALUES OF «/a 


X 

.00 

.01 

.02 

.03 

.04 

.05 

.06 

.07 

.08 

.09 

0.0 

0.798 

0.804 


0.817 

0.824 

0.830 

0.836 

0.843 



0.1 

0.863 

0.869 


0.882 

0.889 

0.896 

0.002 

0.909 


0.923 


0.929 

0.936 


0.950 

0.057 

0.964 




0.991 

0.3 

0.998 

1.005 

1.012 

1.019 

1.026 

1.033 

1.040 

IftikJ 

1.054 

1.062 

0.4 

1.069 

1.076 

1.083 

1.000 

1.097 

1.105 

1.112 

1.119 

1.126 

1.134 

0.5 

1.141 

1.148 

1.156 

1.163 

1.171 

1.178 

1.185 

1.193 

1.200 

1.207 

0.6 

1.215 

1.222 

1.230 

1 237 

1.245 

1.253 


1.268 

1.275 

1.283 

0.7 

1.290 

1.298 

1.306 

1.313 

1.321 

1.320 


1.344 

1.352 


0.8 

1.367 

1.375 

1.383 

1.391 

1.399 

1.406 

1.414 

1.422 

1.430 

1.438 

0.9 

1.446 

1.454 

1.461 

1.469 

1.477 

1.485 

1.493 


1.509 

1.517 

1.0 

1.525 

1.533 

1.541 

1.549 

1.557 

1.565 

1.573 

1.581 

1.590 

1.598 

1.1 

1.606 

1 . 6U 

1.622 

1.630 

1.638 

1.646 

1.655 

1.663 

wwm 

1.679 

1.2 

1.687 

1.696 

1.704 

1.712 

1.720 

1.729 

1.737 

1.745 

1.754 

1.762 

1.3 

1.770 

1.779 

1,787 


1.804 

1.812 

1.820 

1.829 

1.838 

1.846 

1.4 

1.854 

1.862 

1.871 

1.879 

1.888 

1.896 

1.905 

1.913 

1.022 


1.5 

1.938 

1.947 

1.955 

1.964 

1.972 

EEH 

1.990 

1.998 

^ ' iityJ 


1.6 

2.024 

2.033 


2.050 

2.058 

2.067 

2.076 

2.084 

E 1 

2.102 

1.7 

2.110 

2.119 


2.136 

2.145 

2.164 

2.162 

2.171 

E 1 

2.188 

1.8 

2.197 

2.206 

2.215 

2 223 

2.232 

2.241 

2.250 

2.258 

2.267 

2.276 

1.9 

2.285 

2.294 

2.303 

2.311 

2.320 

2.329 

2.338 

2.346 

2.355 

2.364 

2.0 

2.373 

2.381 



2.408 

2.417 

2.426 

2.435 

2.444 

2.453 

2.1 

2.462 

2.470 

2,479 

2.488 

2 497 

2.506 

2.515 

2.524 

2.533 

2.542 

2.2 

2.551 

2.560 

2.569 

2.578 

2.587 

2.506 

2.605 

2.614 

2.623 

2.632 

2.3 

2.641 

2.650 

2.659 

2.668 

2.677 

2.687 

2 . 6 % 

2.705 

2.714 

2.723 

2.4 

2.732 

2.741 

2.750 

2.759 


2.777 

2.786 

2.795 

2.805 

2.814 

2.5 

2.823 

2.832 

2.841 

2.850 

2.859 

2.868 

2.878 

2.887 

2 896 

2.905 

2.6 

2.914 

2.923 

2.932 




2.969 

2.978 

2.987 

2.907 

2.7 

3.006 

3.015 

3.024 



3.052 

3.001 


3.079 

3.089 

2.8 

3.098 

3.107 

3.116 

3.126 

3.135 

3.144 

3.153 

3.163 

3.172 

3.181 

2.9 

3.190 

3.200 

3.209 

3.218 

3.227 

3.237 

3.246 

3.255 

3.265 

3.274 

3.0 

3.283 

3.202 

3.302 

3.311 

3.320 

3.330 

3.339 

3.348 

3.358 

3.367 

3.1 

3.376 

3.386 

3.395 

3.404 

3.413 

3.423 

3.432 

3.441 

3.451 

3.460 

3.2 

3.470 

3.479 

3.488 

3.498 

3.507 

3.516 

3.526 

3.535 

3.544 

3.554 

3.3 

3.563 

3.573 

3.582 

3.591 

3.601 

3.610 

3.620 

3.629 

3.638 

3.648 

3.4 

3.657 

3.667 

3.676 

3.685 

3.695 

ES2II 

3.714 

3.723 


3.742 

3.5 

3.751 

3.761 

3.770 

3.780 

3.789 

3.799 

3.808 

3.817 

3.827 

3.836 

3.6 

3.846 

3.855 


3.874 

3.884 

3.893 

3.902 

3.912 

3.992 

3.931 

3.7 

3.940 

3.950 

3.959 

3.969 

3.978 

3.988 

3.097 


4.016 

4.026 

3.8 

4.035 

4.045 

4.054 

4.064 



4.092 


4.111 

4.121 

3.9 

4.130 

4.140 

4.149 

4.159 

4.169 

4.178 

4.188 

4.197 

4.206 

4.216 

4.0 

4.226 j 

4.235 

4.245 

4.254 

4.264 

4.273 

4.283 

4.292 


4.312 
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warrant any more accuracy in the roots than is given by m 3 and S 3 , 
and one would not do the two extra computations. The results in Figure 
5 were obtained by using the same set of observations (with mean 1.312 
and standard deviation .158) as was used to obtain the results of 
Figure 1. 
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A SHORT-CUT METHOD OF FITTING 
A LOGISTIC CURVE 


William A. Spurr and David R. Arnold 
Stanford University 

“Growth” curves have been used in many fields despite the 
extensive calculations required. These curves may be applied 
even more widely if their construction can be simplified. This 
article shows how to fit a logistic (Pearl-Reed) curve quite 
simply by means of a nomograph (which determines the upper 
limit from three selected points) and a logistic grid (which re¬ 
duces the curve itself to a straight line).^ Other “growth” 
curves may be fitted by constructing appropriate graphs. 

THE METHOD 

T he first step in fitting a logistic curve to a time series by any 
method is to determine whether the process represented has the char¬ 
acteristics of population gi-owth that will justify the use of the Pearl- 
Reed equation as a logical approximation.® The next step is to plot the 
complete series on semi-logarithmic graph paper to determine empiri¬ 
cally whether the rate of growth is steadily declining. Then three points, 
sometimes geometric averages of several years, may be selected, 
representing typical levels of the early, middle, and recent periods. The 
points must be equidistant in time. 

The nomograph (Figure 1) is then used to determine the upper limit 
fc of the logistic curve from three selected points Fo, Fi, and F 2 as 
follows: Compute Yi/Yq and F2/F1 by slide rule;* place a transparent 
ruler across these values on the left and center scales, and read off 
fc/Fo from the right hand scale. Multiply by Fo to find fc. 

Once the upper limit k is found, the Pearl-Reed curve may be deter¬ 
mined readily from the logistic grid (Figure 2) which reduces this curve 
to a straight line. Divide the tlirce selected points by k and plot these 
as percentages on the grid, using any convenient time scale on the 
X axis. Draw a straight line through the points^ and read the values of 
this line at intervals of time, including the future, if a forecast is de¬ 
sired. Multiply those values by fc, plot on the original somi-logarithmic 
chart and connect by a smooth curve. This is the logistic curve. 

1 The wiitera are indf^bted to Professors Albert Bowker and Alfred Niles of Stanford University 
for sugp:eBtion8 on this paper. 

* Soe R. Pearl and L. J Reed. The Biciogy Popvlation Qrovsth, Knopf, 1925. 

* Yt/Yi must be less than Y\/Y% for this type of curve. 

* If the three points do not fall on a straight line, an arithmetio error has been made. 
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These graphs may be used in any of four ways: 

(1) to obviate all mathematical calculations except for several 
slide-rule ratios; 

(2) to test any three selected points for goodness of fit. If the 
graphic lo^stic curve fits the data poorly, other combinations 
of three points may be tried. Then k may be computed mathe¬ 
matically for the points fiLaally chosen; 

(3) to serve as a quick check on the accuracy of machine calcula¬ 
tion, since any error is likely both to distort k considerably and 
to destroy the linearity of the three points on the lo^stic 
graph; 

(4) to make a further test of the validity of the logistic equation 
itself if desired. The raw data multiplied hyl/k must be roughly 
linear when plotted on the logistic grid to justify the use of 
this equation. 

The following procedure is recommended: Use the nomograph for 
purposes (2) and (3) above, but also compute 1/k mathematically, since 
the nomograph is fairly approximate. Then fit the actual curve by 
means of the logistic grid alone, since this is probably quite accurate 
enough. Elnlarge Figures 1 and 2 by photostat to a convenient size, and 
use tradng paper over the latter. 

niscussiON 

The logistic, Gompertz, and other S-shaped curves representing 
retardation of rate of growth have come into wide use since 1920 as 
means of measuring and projecting biological growth generally, and 
secular trends of population and industries in particular. The simple 
logistic or Pearl-Reed curve fitted to three selected points is probably 
the most widely used of these curves,® so it has been selected for solu¬ 
tion here. ^ 

The logistic curve may be written: Y = 

1 -|- 

where the curve ordinate Y approaches k the upper limit as time, X 
increases; e is the natural logarithm base 2.71828, and a and h are con¬ 
stants, h being negative. 

Even this simple form of the growth curve, however, requires the 
lengthy calculation of the three constants, k, a, and h, a procedure 
that must be repeated for each set of three points that are tested for 

s See Simon Kuenets, Seetdar Movemenia in Production and Priaa, pp. 59-68,197-199, Houghton 
MifBin, 1930, or a recent popular description in B. Dewey and E. F. Dakin, Cydea, Holt, 1947. 
C5h. I-IV. 
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goodness of fit?. Moreover, despite the precision of mathematical cfd- 
culation, the results are only approximate, since (1) the equation itself 
is merely one of several empirical approaches to the “law of growth”, 
(2) the selection of three points is subicctive, and (3) errors of any kind 
are exaggerated in extrapolating to find the upper limit. 

The graphic method obviates the following calculations; 

27oFiF* - ri*(y. + 70 
‘ Y,Y. - 

, k-7o 

= log, - 

7o 

1 , _ /'7,(fc - 7i)N 

k (This must be calculated for each 

7 -— point of the fitted curve.) 

1 + 

The errors of graphic measurement are believed to be small compared 
with the errors implicit in the Pearl-Reed equation itself and the 
selected points as described above. 


THS NOMOGBAPH 


The nomograph for finding the upper limit was constructed by 
reducing the logistic equation to: 

0 B - eA- 2AB + A*B +A = 0 

where =k/7o, = 7i/7o and B = 7*/7i. 

This equation was expressed as a third-order determinant which was 
multiplied by a matrix of transformation to produce a nomographic 
determinant.’' 

The equations for the coordinates of the nomographic scales (in 
arbitrary linear units) were then found to be: 


• Soo F. E. Croxton and D. J. Cowden, Applied Oeneral Sta/ieiica, pp. 452-45$, Prentice-Hall, 1939* 
The methods of least squares and moments are much more cumbersome than that desciibed 

The present method appeals to be shorter than that recently smycested by Dudley J. Cowden, 
(Journal of the American St<Ui*>ttad Aaeoeiatum, Dec. 1947, pp. 585-590), since: (1) the trial values of k 
may be determined from a nomograph, rather than computed; (2) the lofdstic grid straightens out the 
logistic curve itself, whereas Professor Cowden uses a semi-logarithmic grid to produce a linear trans- 


1 

formation of the function- 

Yc 


1 


K 


, which requires five extra columns of computationB or scale readings 


(p. 589). The method described here should also be shorter than that of Raymond Pearl (Introduction 
to Medical Biometry and Statistics, Third Edition, Chap. XVIII, W. B. Saunders Co., Philadelphia, 
1940) for similar reasons. 

V See F. T. Mavis, The Construction cf Nomographie Charts, pp. 56-57, Scranton, Fezmeylvania: 
International Textbook Co., 1939. 
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Z, => 0 

F. - 

4 

2.525 


5.765 - 9 

Xb -- 

Yb — 


0.765 - 4 


4 - 0.765 

2.52A 


7.76A - A* - 10 




0.76A - 4 


4 - 0.76A 


A nomograph of any size may be constructed from these equations 
by using an appropriate scale. However, it is simpler merely to enlarge 
Figure 1 by photostat or photograph, extending the h/Ya line upward 
if de^ed. 

The error in reading h from the nomograph is on the order of 2 per 
cent for the usual range of h values shown in Figure 1 (4<A:/yo<45) 
using a V by 10' chart, but is reduced for all points on the logistic 
curve since they are only fractions of Jt. These errors are fairly small 
compared with the hyper-sensitivity of h itself to the three subjectively 
chosen points.* The error increases for the extremely high and low 
values of fc which are not shown. These should be computed mathe¬ 
matically. 

The nomograph in general should be more generally used in sta¬ 
tistics, as it is in engmeering. It has these advantages: (1) the chart is 
simple and easy to read; (2) interpolation is made fairly accurately 
along a scale; and (3) the chart shows at any point the nature of the 
change in one variable due to changes in the other variables.* A curved 
nomograph, such as this one, saves even more labor than the more 
usual linear tsrpe,^* since it replaces a more complex equation ivith the 
attendant labor of calculation. 

In case it is desired to fit a logistic curve by using all available points 
rather than only three points, the following short-cut method is sug¬ 
gested for approximating the upper limit k. This method utilizes the 
principle that the per cent rate of increase in a logistic curve is a linear 


s For example, if the first two points are 1 and 2» a variation in the third point from 3 to 4 will 
cause k to vary from 4 to infinity. Estimates of the upper limit of California’s population (which is 
nearly an exponential curve) by 18 Stanford students l»8ed on various selections of three points from 
the same data varied from 18 to 613 millions. The variation is relatively small, though still considerable, 
in an older series. According to M. C. Rorty (/oumof o/fAe American Statistieal Aasociation, March, 
1931, pp. 8-9) the growth curve is significant when the series has reached within 20 to 30 per cent of the 
saturation point. 

9 Joseph Lipka, Graphical and Mechaniecd Computation, p. 44 f, 'Wiley, 1918. 
u See J. W. Dunlap and A. K. Kurtz, Handbook of SlatiaHeal Nomographs, TdNes, and Formtdas, 
iPart I. World Book Co., Yonkers, New York, 1932, for an excellent compilation of linear statistical 
nomographs. 
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function of the population itself, and that the rate approaches zero as 
the population approaches k. Compute by slide rule the percentage in¬ 
creases in the data from point to point. Then estimate the geometric 
mean of each pair of points from the middle of the line connecting the 
points on semi-logarithmic paper.” Now plot the percentage increases 
on the Y axis against the corresponding means on the X axis in a 
scatter diagram. Fit a straight line to these points by inspection or by 
least squares. (The goodness of fit of this line is a test of the validity of 
the lo^stic.) This line extended will cross the X axis at the desired 
value of k. However, as Yule said,” “I do not think this is a very good 
method of fitting.” The graphic form of the method described above is 
shorter than Yule’s or Hotelling’s versions and is about as accurate as 
Yule’s; but it still involves several approximations, and is much more 
work than the use of the nomograph with three points only. 

THB LOGISTIC GBXD 

The lo^tic grid not only provides a linear transformation of the 
logistic curve, but may also be used to find the remaining constants in 
its equation, if desired, as follows: 



XtO% ~ Xit.9t% 

o = — bXt<i% 


where the X values are the abscissas corresponding to the Y/k values 
of 50 per cent and 26.89 per cent respectively.” 

Any lo^stic scale may be computed as follows: For all desired scale 
values of Y/k up to 50%, list the corresponding values of log, {k/Y— 1) 


u More precise but laborious methods of computing these values have been suggested by Yule 
(Journal of the Royal Statistical Society, 1925, pp. 1-58) and Hotelling (This Journal, September, 
1927, pp. 283<^14). Hotelling shovrs an illustrative scatter diagram of this general method on page 299. 
Op. c»£., Method 3. p. 62. 

^ Proof: 1. liet Y/k »60% in the logistic equation. Then a » — 


2. Let Y/k ap. Then a 

b 






/i-A 

aloge I -j- Substituting 


a above. 


1. Then p *-a26.89% 

1+s 




1 
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FIGURE 2 

from tables of reciprocals and natural logarithms (such as Barlow’s 
TahUs and the Federal Works Agency, Tables of Natural Logarithms). 
Multiply the results by a constant to obtain the desired scale. Then 
measure these figures as linear distances down from the center of the 
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chart for Y/k values below 60 per cent and up for complementary 
values above 50 per cent. For example, for F/fc = .01 per cent (or 99.99 
per cent) 1=9999, its log« is 9.210, so this line would be scaled 

9.210 inches or a multiple thereof below (or above) the middle of the 
chart.^^ 

Again, however, a photostatic enlargement of Figure 2 is a simpler 
method of constructing this scale. 

The accuracy of reading the logistic graph varies with the scale and 
with the vertical position of points on the chart. For a T' by 10*" chart, 
the average error will vary from about .01 of 1 per cent at the top and 
bottom to about 0.5 of 1 per cent at the center. The error will be pro¬ 
portionally less for larger charts. Again, these errors seem small com¬ 
pared with the approximations involved in the selection of three points 
and the equation itself. The logistic scale, unlike the nomograph, is valid 
for all values of /b. 

The logistic giid superficially resembles the probability grid which 
provides a linear transformation of the cumulative normal probability 
curve, derived from the law of errors rather than the “law of growth.” 
The use of the logistic scale will avoid the sizeable error involved in the 
use of the probability scale as a means of approximating the Pearl- 
Eeed curve.^® 


« Linear ordinates (t) for integral percentages of T/k (y') from 1 per cent to 99 per cent are listed 
by Yule. op. off., p. 48. 

“ For example, Croxton and Cowden, op. cif., pp. 458-461, reach an upper limit of 400 millions 
for U. S. population from probability paper, compared with 190.8 millions from the Pearl-Keed equation 
and their consensus of "well under 200 millions” from other sources. The logistic function is claimed 
to be superior to the probability function as a measure of growth by C. P. Winsor {Journal of the 
Waslunffton Academy of Sciences^ February 19, 1932, pp. 73-84) and by J. Berkson {Journal of the 
American Statistical Asi^ociaiwn^ September, 1944, p. 357 f). It Ls criticised by A. L. Bowley {Journal 
of the Royal StaiiMical Society^ 1925, pp. 76-81), G. H. Davies {Journal of the American StatuHcal 
Asaoeiaiion, 1927, pp. 370-374), and Q. U. Knibbs {Journal of the American Statistical Association^ 
1926, p. 381 f and 1927, p. 49 f). 
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Rutgers University 

Bevezet^s a Statisztika Tudomdnyllba: B.§sz L (Introduction to the Science of 
Statistics, Part 1.) Gyrda Abay. P4cs, Hungary: Rdkdczi Nyomda (Rdkdczi-Ct 
35). 1945. Pp. ii, 98. 

Statis 2 dika Mddszertani Alapvetes. (Elements of Statistical Methodology.) 
Lordnd Schweng. Budapest. Hungary: Stephaneum Nyomda, 1944. Pp. xviii, 
366. 


Review by Eugene Lukacs 
Professor of Mathematics, Our Lady of Cincinnati College 
Cincinnali, Ohio 

A bates textbook, intended for students of law and social sciences, is di- 
L vided into five chapters. Chapter 1 surveys the various fields of applied 
statistics and classifies social statistics into statistics involving population 
problems, economic statistics, and statistics related to public health and 
education. Chapter 2 discusses the history of statistics with particular refer¬ 
ence to Hungarian statistics. Chapter 3 treats the gathering of statistical 
data and the work of the oflicial Hungarian statistical services. Chapter 4 dis¬ 
cusses the organization of the data and the construction of empirical fre¬ 
quency distributions; numerous examples and graphs are given. Chapter 5 
deals with measures of central tendency and dispersion. 

Although Abay states that it is impossible to discuss modern statistics 
without using mathematical methods, he avoids the use of mathematical 
formulae. There are a few incorrect statements: for example, on pages 9 and 
10, the author states that every frequency curve would be a normal curve if 
one only had enough data. Statistical methods developed within the past 
forty years are not considered. The book is not likely to lead the student to a 
proper appreciation of statistical methods. 

Schweng's text—prepared for students of social sciences, especially eco¬ 
nomics—discusses in great detail methods of collecting and organizing sta¬ 
tistical data. The first part of the book deals with the collection of data and 
the nature of statistical tables. The proper consideration of significant fig¬ 
ures is greatly emphasized. The second part discusses graphs, diagrams, 
ratios, and percentages. The third part presents an elementary discussion of 
frequency distributions and measures of central tendency and dispersion. 
Very few formulae are used, and no proofs are given. The normal distribu¬ 
tion is barely mentioned. Parts 4 and 5 give brief introductions to time series 
and index numbers. Part 6 surveys the most important Hungarian index 
numbers. 

Numerous examples are ^ven throughout the text. Statements and defi- 

135 



136 


AMERICAN STATISTICAL ASSOCIATION 


nitions are always illustrated by tables and graphs. The 5-page bibliog¬ 
raphy lists a large number of American, British, French, German, and 
Italian books. The book does not reflect the progress of theoretical statistics 
during the last forty years and therefore can not be considered an introduc¬ 
tion to modem statistical methods. Nevertheless, the book may prove use¬ 
ful for the consumer of Hungarian statistics rather than for the producer of 
statistical data. 

College Algebra. A. Adrian Albert (Professor of'Mathematics, University of 
Chicago). New York 18: McGraw-Hill Book Co., Inc. (330 West 42nd St.), 
1946. Pp. xii, 278. $2.75. (London W.C. 2: McGraw-Hill Publishing Co. Lti 
[Aldwych House, Aldwych]. 14s.) 

Review by Paul S. Dwyer 
Professor of Mathematics 
University of Michigan, Ann Arbor, Michigan 

T his book incorporates a drastic revision of the conventional approach 
and treatment of the subject matter of college algebra and as such 
deserves the attention of statisticians. Important objectives of the author 
are: 

a) The material should be presented as a unified and compact body of 
mathematical theory. As worked out by the author, this unity is achieved 
by the successive study of such general topics as the number systems of ele¬ 
mentary mathematics, polynomials and allied functions, algebraic identities, 
equations and systems of equations. Considerable material not usually pre¬ 
sented in a course in college algebra is included in this approach. 

b) The definitions and theorems should be stated accurately. This means, 
of course, that the author has been quite formal in the presentation of the 
material. This formality will probably be approved by mathematical statis¬ 
ticians. It may not be so welcome to other statisticians who wish to study the 
main results with as little of the vocabulary and notation of the mathe¬ 
matician as is possible. 

c) Proofs of results should be given only when it is probable that the better 
students will understand them and when their inclusion will add to the 
understanding of the results. The author is very careful to indicate each 
statement which is lacking in proof. In this connection, the final chapter 
(Chap. 10) on matrices and quadratic forms should be noted since the 
author makes no pretense at proofs here. This chapter, frankly experi¬ 
mental, is for the purpose of providing matrix material for those who un¬ 
derstand the definitions, notation and statements of theorems. As such it 
will be of interest to many statisticians. 

d) There should be an adequate number of illustrative examples and 
exercises. The author has provided these and it seems that he has provided 
another feature which will be of interest to applied statisticians—a greater 
emphads upon numerical work than is found in most texts on college algebra. 
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e) The text should abound with additional material on the same subject 
for the better student. The author has accomplished this with the use of ad¬ 
ditional sections and chapters which are indicated as “full course” and which 
tend to amplify the minimum treatment. In some of this optional material 
the author attempts to relate the subject matter of college algebra to other 
mateiial in college mathematics. Chapter 8, “Vectors in the Plane” is of this 
sort. 

The statistician will be particularly interested in the treatment of Chapter 
9, “Matrices, Determinants and linear Systems,” and Chapter 10, referred 
to above. He may or may not approve of the introduction of the concept of a 
matrix before that of a determinant, but he certainly will approve of the use 
of elimination methods, rather than determinants, in solving linear simul¬ 
taneous equations. It is unfortunate that the author does not also show that 
elimination (condensation) methods can be used in evaluating determinants 
and, in fact, that the same general procedures can be employed in the for¬ 
ward solution no matter whether one solves equations, evaluates determi¬ 
nants, or computes the inverse of a matrix. The author does give a general 
treatment of determinants of any order. 

Chapter 10 has been described above. It ^ves a number of basic theorems 
and techniques which are commonly useful, together with simple problems. 
It is unfortunate that the definition of matrix multiplication is given only 
in terms of the formal row-by-column multiplication of A and B and is not 
extended to the frequently useful column-by-column multiplication of A' 
and B or the row-by-row multiplication of A and B\ 

The first copies of the book contained an unusually large number of mis¬ 
takes not only in the answers to the problems but in the theory itself. Most 
of these have been removed in the second printing. 

The author is to be commended for the way in which he has accomplished 
his objectives. Each reader must determine for himself whether these are 
appropriate objectives for him. I think that we as statisticians will agree 
that the emphasis upon numerical work, the more practical approach to the 
problem of simultaneous linear equations, and the inclusion of matrix ma¬ 
terial arc all steps in the right direction. 


A Concise Manual of Statistics: With Special Reference to the Reqtiirements of 
Students for Municipal Examination, Second Edition. Clement Burton (Borough 
Trcasuier’s Department, Hackney Town Hall, London E. 8, England). London 
E.C. 2: Gee and Company (Publishers) Limited (27-28_Basinghall St.}, 1946. 
Pp. 182. 16s. 


Review bt Harry Pellb Hartkbmeibr 
Professor of Business Statistics, University of Missouri 
Columbia, Missouri 


T he preface of this book contains the statement: “The increasing im¬ 
portance of the subject of statistics in commercial and municipal activity 
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has prompted the preparation of this handbook in the hope that it will prove 
of interest generally, and will be of assistance to those who are studying for 
the examinations of the various professional bodies of accountants, and more 
particularly for those of the Institute of Municipal Treasurers and Account¬ 
ants ('Incorporated).” In England it is necessary for a person to pass an 
examination in the field of statistics before he is allowed to practice with 
something equivalent to our C.P.A. This is an excellent practice and the 
authonties responsible are to be congratulated. It would be a very good idea 
if accountants in the United States were required to demonstrate a similar 
knowledge of statistics before they are recognized as Certified Public Ac¬ 
countants. Most accountants are deficient in a knowledge of sampling tech¬ 
niques and tests of significance or reliability. 

This book is an excellent one in view of its purpose as a manual or hand¬ 
book. It will serve an accountant well when he wants to refresh his memory 
about a statistical formula or process. Presumably, the accountant has al¬ 
ready known something about statistics and needs primarily a rapid review. 
This book is not so well suited to be an elementary textbook for a new stu¬ 
dent to use to obtain an introduction to statistics. One book could hardly be 
both an elementary textbook and a concise manual or handbook. 

The book seems a bit jerky at some points. One gets the impression that 
it was written a little longer and parts were deleted to shorten the book. 
This could have been the work of some editor not familiar with the material 
or subject. At most places the material reads smoothly and for an American 
reader the book has some delightfully surprising expressions. For example, 
rounded off data are items ‘‘shorn of digits.” 

Everyone has the right to select his own symbols, but the American reader 
is not apt to like the choice of a for the arithmetic average, Z for the mode, 
and M for the median. The usual meaning for the probable error is found on 
pages 123 and 124 in connection with the explanation of correlation, but there 
may be some confusion when the reader compares tliis meaning with the 
statement on page 29, “When figures are taken, for example, to the nearest 
100, the maximum possible error is 49; but as the error may be anywhere 
between 0 and 49 the average or probable error is 24.5.” 

On page 124 the reader encounters eight statements dealing with the 
interpretation of the value of r. Two of them are: “If r is less than .3—^little 
evidence of correlation.” “If r is more than 3 times the probable error—evi¬ 
dence of correlation.” No mention is made of the possibility that r could 
easily be 3 times the probable error and be less than .3. The two different 
conclusions conflict. What could be said about such a situation is that when 
r is more than 3 times the probable error there is reliable evidence that some 
correlation exists but when r is also less than .3 the correlation is not large 
enough to be of practical value for estimating the size of one variable from a 
knowledge of the other variable and the equation of relationship. 

Very careful type setting or very good proofreading was responsible for 
the fact that very few errors are noticed. 
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One of the principal virtues of the book is the honest recognition of the 
value of the work of the statistician. “The position of the statistician lies 
between the accountant or cost accountant and the economist or executive 
authority.” No doubt this will sound like heresy to accountants in the United 
States, who for years have claimed the position of being the right-hand man 
of the executive. At long last, people are beginning to realize that accounting 
is but a pait of the much broader science of business statistics. 


Mathematische Erblichkeitsanalyse von Populationen. Gunnar DaKlberg (Pro¬ 
fessor and Head of the Swedish State Institute for Human Genetics and Hace 
Biology, Stockholm, Sweden). Aus Dem Staatlichen Institut fur Menschliche 
Erblichkeitsforschung und Rasenbiologie zu Uppsala. Diese Arbeit erscheint 
auch alb Supplcmentum CXLVlll von Acta Medica Scandinavica. Uppsala, 
Sweden: Almqvist & Wiksells, 1943. Fp. 219. Paper. 

Review by Oystein Ore 
Professor of Mathematics, Yale University 
New Haven, Connecticut 

I N THIS work the author gives a review of the applications of mathematical 
methods to certain problems of inheritance. Particular emphasis is placed 
on the consequences for human society of some of the mathematical theories 
and hypotheses and almost all illustrations are drawn from this sphere. A basic 
knowledge of genetical terminology is presupposed. The introductory sec¬ 
tions are devoted to a discussion of the use of the elements of probability to 
the determination of the various gene combinations and their frequencies 
as they may occur in a population. In the later sections considerations and 
methods of an actuarial nature predominate. Here one finds an extensive 
analysis of the effect which certain conditions in the population, for in¬ 
stance occurrence of mutations, existence of isolated groups and selection 
of various kinds, e.g, mating with closely related individuals, will eventually 
have on the gene frequencies in a society. Of particular value are the numer¬ 
ous tabular computations which illustrate the consequences of the various 
numerical assumptions one may make in regard to the perturbing factors. 


Cycles: The Science of Prediction. Edward R. Dewey (Director, Foundation for 
the Study of Cycles, 274 Madison Ave., New York, N. Y.) and Edwin F. Dakin 
(Counsel in Public Rehitions, Plymouth, N. H.). New York 10: Henry Holt & 
Co. (257 Fourth Ave.), 1947. Pp. xi, 255. $3.00. Two reviews follow. 

Review by Milton Friedman 
Associate Professor of Economics, The University of Chicago 
Chicago, Illinois 

T here are, say Dewey and Dakin, a number of important rhythms or 
cycles in economic activity. These cycles have fixed periods and are com¬ 
mon to many different activities. Any particular economic series is to be 
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conceived as the sum of such cycles superimposed on a trend and dis¬ 
turbed—^though only temporarily—^by such erratic and random factors as 
wars. The appropriate technique for predicting the future course of any 
economic series is to determine empirically from its past behavior its trend 
and the chief cycles it contains, to synthesize these, and extrapolate them into 
the future. Dewey and Dakin conclude from analysis of this type so far per¬ 
formed that the four most important of the many rhythms have been isolated 
in our economy are: 

A. The 54-year rhythm in wholesale prices and industrial innovations. 

B. The 9-year rhythm in wholesale prices, security prices, pig iron pro¬ 
duction, and industrial actirity. 

C. The 3i-year rhythm in security prices and in business activity, whole¬ 
sale and retail. 

D. The 18-year rh 3 rthm in real estate activity, and in related industrial 
enterprise (p. 135). 

These rhythms operate in an economy that is growing at a declining rate and 
is approaching “maturity,” so that their downward phases will appear rela¬ 
tively more severe, and their upward phases less buoyant, in the future than 
in the past. The 64-year rhythm turned down in 1925, and “is due to reach 
bottom in 1952” (p. 226); the 9-year rhythm was due for a peak in 1946 and 
a subsequent decline until 1951; the 3i-year rhythm was due for a peak in 
1947 and again in 1950 and for lows in 1948 and 1951; the 18-year rhythm in 
real estate activity “apparently reached its high about late 1942” and is 
“due to decline to a low around 1953” (p. 227). “The foregoing is not meant 
to suggest that our rhythms indicate collapse of our economy. We may only 
infer, if they continue, that around 1947 a fall-o£f will start, and our economy 
will experience rather protracted declines in prices, production, employment, 
and economic activity generally, reaching a bottom that would presumably 
be dated sometime in the early fifties” (p. 230). 

The book presenting this message is difSicult to classify. It is not a scientific 
book: the evidence underlying the stated conclusions is not presented in full; 
data graphed are not identified so that someone else could reproduce them; 
the techniques employed are nowhere described in detail. Yet it is not a popu¬ 
lar book: there are too many graphs and too much technical jargon to call it 
that. Its closest analogue is the modern high-power advertisement—^here of 
book length and designed to sell an esoteric and supposedly scientific product. 
Like most modern advertising, the book seeks to sell its product by making 
exaggerated claims for it (that it will give suflBciently accurate forecasts to 
contribute substantially to success in business), showing it in association 
with other valued objects which really have nothing to do with it (established 
cycles in physical, meteorological, or other natural phenomena, and the 
successful use of cycles in predicting such phenomena as tides), keeping 
discreetly silent about its defects or mentioning them in only the vaguest 
form (no examples of past failures of cyclical patterns to reproduce them¬ 
selves are mentioned), and citing authorities who think highly of the 
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product. As with most modern advertising, the reader will find the book easy 
and interesting to read; when he finishes, he will find himself favorably in¬ 
clined toward the product; yet, when he reflects, he will find that there is 
little reason why he should be. What is good is commonplace, and what is 
claimed as exceptional is unproven. There is little doubt that approximate 
rhythms exist in economic activities, that there is a building cycle in the 
United States about 18 years in length, and a business cycle about years 
in length. There is also little doubt—^though the authors do not mention this 
—^that every attempt to use these facts or any others to forecast economic 
activity has to date met with failure. Beyond this, we know little. The 
authors assert a good deal more, but these assertions, though illustrated by 
selective examples, are not demonstrated by any comprehensive or carefully 
presented body of evidence. Even the techniques used to derive the particular 
conclusions about lengths of cycles and the like are only alluded to in pasdng 
in a chapter (Chapter 11, “Analysis and Synthesis”) which is clearly in¬ 
tended to impress the reader with the scientific magic used to get the stated 
results rather than to initiate him into the art or to enable him to test the 
results obtained by the authors. 

An excellent, though somewhat extreme, example of the level of analysis 
in the book is its treatment of secular trends. The book makes much of the 
universality of retardation in the rate of growth of individual industries. 
From this well established phenomenon, it erroneously deduces that the 
economy at large must grow at a retarded rate and approach maturity. And 
it makes this erroneous deduction despite a reference (in which, incidentally, 
the title is given incorrectly) to A. F. Burns' Production Trends in the 
United States Since as “required reading” (p. 62). Yet one of the most 
important conclusions of Burns' book is that retardation in the rate of growth 
of the economy as a whole is not a necessary result of retardation in the rate 
of growth of each industry separately; and, further, that there is little evi¬ 
dence in American data of the existence of retardation for the economy as a 
whole. 


Review by Max Sastjlt 
Research Associate, The Robinson Foundation 
Washington, D. C. 

T his is an important, certainly a significant book. We may discount the 
jacket statements that it 

establishes the ground for a completely new approach to the economic prob¬ 
lems of our era. It puts a tool in the hands of businessmen, statesmen, and 
serious readers which will enable them to predict the economic outlook for 
the critical years ahead. 

This is over-optimistic, even for a jacket blurb. But the book is significant 
as a contemporary sociological document, emphasizing the difGiculties of de¬ 
veloping a realistic economics in the service of society. 
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The authors, according to the jacket, are experienced in counseling 
“corporate enterprise.” The book is in fact a model of skillful writing. A good 
deal of information is given about “cycles,” but not more than confused 
intimations about the science of prediction. The content of the book boils 
down to the quest for a science of prediction—^by business counselors. The 
title and the style are, however, very effective. Backed by skillful publicity 
in the preceding years, the book has been a best-seller for months! The ob¬ 
jective of the reported research is not abstract theory; it is the age-old quest 
of “Forecasting for Profit.” The book may contribute to this aim some¬ 
what, indirectly. Indirectly it may contribute to the progress in cycle re¬ 
search by high-lighting the gulf between the demand for realistic economic 
analysis and the generally barren theory and disputation available as supply. 

The book appears to be a progress report of “a non-profit corporation” 
organized as the Foundation for the Study of Cycles, of which Mr. Dewey is 
director. The early chapters treat of trends and patterns in typical industries. 
Well-known incidents of recent decades in commerce and industry are pre¬ 
sented in the jargon of the financial and commodity marts with disarming 
easy simplicity. The next few chapters deal with rhythmic cycles. Certain 
cycles of nature, some more or less recently noticed, are first discussed. 
Then a chapter each is given to certain marked economic cycles that have 
become familiar in recent literature—^the cycles of 54, 9, 3|, and 18 years. 
The final chapters treat of purported causal and correlation analyses of eco¬ 
nomic series; the effects of the War; and probable postwar trends and 
rhythms. In three short appendices are presented—as tools for the reader (?) 
—^the rudiments of the “ratio” scale, moving averages, and the “section 
moving average.” No index is supplied. 

Very effective advance publicity was given the book in a popular article^ 
in 1943. This led Smith and Duncan* to the belief—^in 1944—that: 

So general was the interest in cyclical behavior [after the 1930’s] that 
by 1940 the Foundation for the Study of Cycles ... proposed to help in 
the task of integrating the work of the thousands of scientists and statis¬ 
ticians who are contributing in various fields to the study of cycles. Not 
only have cycles been found to exist in the realm of business activity, but 
scientists in many other fields believe they have discovered cyclical be¬ 
havior in their respective studies. 

Dewey stated, in his 1943 article, “the results are most encouraging. Fore¬ 
casts made by projecting well-established cycles are coming true every 
day.” He predicted: 

Figure on the possibility of a major financial and economic setback start¬ 
ing about 1947.. .. Heavy industry will begin to slack off next year [1944] 
if the 41-montb rhythm of peaks and valleys prevails over this war as it did 
during the last one. 

In the present book, in 1947, Dewey and Dakin purport to do more than 

1 Dewey, Edw«rd B. ‘Sdenoe Predicts the Fatxire.” Am Mag 136:90-2 S '43. 

> Smith, James Q., and Duncan, Acheson J. ElemerUary StatisUca and Ap%AicaHona: Vol. J, Fundet^ 
mantah <tf the Theory of StoHakat p. 652. New York: McGiaw-HiU Book Go., Inc., 1944. 
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"integrate” the work of numerous others. They believe that [like nuclear 
physics] 

economics is now reaching a point where it can hope also to make rather 
accurate piodictions, within limits which this study will explain (p. vii). 

The reader will be introduced to a method of thinking about the future 
which—new though it may be to him—seems definitely to have proved of 
value. It is this method which is of fundamental importance—^an im¬ 
portance greater than any specific conclusions to which it may lead. For on 
its validity depends the whole value of the conclusions (p. viii). 

The authors then record their deep indebtedness 

to those whose names, equations and graphs line the pages of this book— 
and to many others unnamed—.... Theirs is the pioneering that is moving 
economics out of the blind alley where it stood for many years, so that it can 
take its rank as a true science (p. viii). 

The substance of the book is in fact largely derived from available fan iliar 
studies on cycles: Persons, Mitchell, Schumpeter, and especially W. I. King 
on economic cycles; Harlan Stetson, and others on sunspot and other cycles 
in nature. The authors, however, read much more of an inexorable periodic¬ 
ity into the work of these well-known scholars than is warranted. The dis¬ 
turbing implication of this determinism is explained away (p. 98) by reason¬ 
ing that so far has proved inscrutable to the reviewer. Indeed, elsewhere 
the authors aver blithely that social forces—^at least quasi-volitional—can 
thwart, or reinforce the normal pattern of the unfolding cycle (p. 113)—^this 
with reference to comments on the "inflationary forces” influencing us at 
present (1947). Some lingering doubts as to the consistency of their position 
are disposed of by a singular argument: one must not "reason solely on the 
ordinary causc-and-eflect basis” in business, or in general economic analyses, 
(p. 114). The authors purport to derive support for this dialectic handspring 
from some of Eddington’s lay reflections (in his Gifford lectures, 1925), and 
from nebulous references to Ouspensky, Spengler and Jeans. 

The principal defect of the authors’ discussion is a misreading of their 
primary sources, and misinterpretation of others. They also omit reference 
to outstanding works that quite invalidate their position. Thus, Harlan 
Stetson, a veteran physicist astronomer, discusses sunspots and their influ¬ 
ence in a number of studies extending over twenty years, and clearly shows* 
that the 11-year cycle, so striking in appearance on the graph, varies widely 
in length—10 to 10 years—and has no more scientific validity than the 
periods of 23, 37, 77, 83, and more years found by other competent in¬ 
vestigators. Mitchell and his coworkers have studied hundreds of economic 
series during the past three decades. In the last annual report^ of the Na¬ 
tional Bureau of Economic Research, Burns demonstrates again that recur¬ 
rences may always be expected, but, so far, they cannot be predicted. The 

* Stetson, Harlan. Sunspots and Their EffedU, pp. 162-72. New York: McGiaw-EGill Book Go., 
1930. 

« Bums, Arthur F. Stepping Stones Toward the Future. Twentj-Seventh Annual Beport of the 
National Bureau of Economic Research. New York: the Bureau, March, 1947. 
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repeated failures during the past century of hypotheses suggesting periods 
from forty months to sixty years in economic series are stressed anew by 
Burns (pp. 3-5). These cannot be blandly explained away by the Dewey- 
Dakin Chapter 13, on “Avoiding Some Economic Illusions.” It is impres¬ 
sive, but neither relevant nor conclusive, when they quote in their argument 
this sententious aphorism from Spengler: 

The separate sciences—epistemology, physics, chemistry, mathematics, 
astronomy—are approaching one another with acceleration, converging 
toward a complete identity of results. 

Their case would not be bettered if, as the authors state, “he could well have 
added biology, psychology, economics, and sociology to his list” (p. 198). 

Prom the technical statistical analysis side, the thesis of the authors is 
thoroughly repudiated in the extended recent study (not noted by them) by 
M. G. Kendall.® Kendall’s authoritative analysis is supported by the panel 
of eminent English statisticians discussing his paper. 

The discussion in the book ignores the well-considered, authoritative 
articles in the Encyclopedia of the Social Sciences (“Business Cycles” by 
Wesley C. Mitchell and “Forecasting Business” by Garfield V. Cox) and 
in the Encyclopedia Brilannica, Fourteenth Edition (“Trade Cycle” by 
D. H. Robertson and “Trade Forecasts” by W. I. King). These authors 
rejected without qualification the idea that economic cycles can be predicted. 
Robertson, noting that for England crises occurred m 1835,1847,1866,1873, 
1882,1890,1900,1907,1913 and 1920, stressed that it is unsafe to dogmatize 
about the length or degree of phase synchronism in the leading industries or 
in leading industrial countries—^for any^ cycle phase. W. I. Bang, on whose 
The Causes of Economic Fluctuations (1938) Dewey and Dakin lean heavily, 
noted clearly with regard to patterns that frequently repeat themselves, that 
the difficulty is “that in many cases expected repetition fails to materialize.” 
The authors also make extensive use of Edgar Lawrence Smith’s study. 
Tides in the Affairs of Men (1939). They give no inkling that he subtitled his 
book as merely “An Approach to the Appraisal of Economic Change.” Fi¬ 
nally, it is singular that the authors make no reference to Burns and Mitchell’s 
significant book, Measuring Business Cycles,^ which was available early in 
1946. Preliminary versions of its content have been available for over ten 
years, e.g., Arthur P. Burns, in a 1936 paper,^ reported that Mitchell’s 
“forthcoming second volume on business cycles” exhibits the markedly 
wide range of variation in length of various economic cycles. 

The wide variations in cycle length is the nub of the difficulty in the quest 
for means of predicting cycle phases. This difficulty has completely escaped 
the authors’ notice. They draw heavily on Schumpeter’s notable Business 

■ Eendall, M. G. “On the Analysis of Oscillatory Time-Series.* J Royal Stat 8oe 108:93-141 *45. 

■ Bums. Arthur F, and Mitchell. Wesley C. Measuring Business Cydes. New York: National 
Bureau of Economic Research, 1946. 

* Bums, Arthur F. “The Brookings Inquiry Into Income Distribution and Progress.* p. 513. 
Q J Boon 50:476-523 My *46. 
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Cycles.^ Schumpeter gives considerable attention to the variability of cycle 
phases; his last chapter is “The Disappointing Juglar” (the 9-year cycle). 
Schumpeter's notable idea of innovations —^which the authors confuse badly 
with Slutsky's “random shocks” (p. 145)—^is of crucial significance here. The 
authors would find some invaluable suggestions for a pragmatic interpreta¬ 
tion of the dynamics of cycles—far beyond the ken of the linguistic labors of 
Spengler and Sorokin—^in Schumpeter's idea of the role of innovations in 
generating cycles.® If the authors would recognize that innovations are like 
“driving forces” in vibrating systems, or like perturbing forces in periodic 
orbits, they could serve more effectively the objectives presumably con¬ 
templated by the Foundation for the Study of Cycles. 

The entire technical apparatus used in this ambitious study is apparently 
revealed in a footnote on page 174. It seems to comprise merely graphical 
juggling of deviations from rudimentary moving average “trends”—direct, 
and rate-of-change. There are ghostly echoes of the “eliminated,” “crossed,” 
and “lagged” trends of the “Quadrature Theory” hoax of the middle 1920's 
(this Journal, March 1924). Naturally the clues to the mechanics of cycle 
generation have eluded the authors. Their avid readers might, perhaps, find 
some light in their quest for a science of prediction in the better treatises on 
vibrating systems, e.g., those of Crandall (1926), or Den Hartog (1940). 
Regrettably, these readers do not appear to have found such light in availa¬ 
ble recent studies on business cycles. 


Probit Analysis: A Statistical Treatment of the Sigmoid Response Curve. D. /. 
Finney (Lecturer in the Design and Analysis of Scientific Experiment, University 
of Oxford.) Foreword by F. Tattersfield (Head of the Department of Insecticides 
and Fungicides, Rothamsted Expeiimental Station, Harpenden, Herts, England). 
London N.W. 1: Cambridge University Press (Bentley House, 200 Euston Road), 
1947- Pp. xiii, 256. 18s. (New York 11: Macmillan Co. [60 Fifth Ave.]. $3.75.) 
Two reviews follow. 


Review by Margaret Merrell 
Associate Professor of Biostatistics 

School of Hygiene and Public Healthy The Johns Hopkins University 
Baltimore^ Maryland 

T he word order of the title of this book rather carries the implication that 
probit analysis is a method tied to problems of biological response. Al¬ 
though the title may not be important, it is mentioned here because it re¬ 
flects the presentation of the material. The method of analysis is almost 
complctel 3 ^ identified with the bioassay problem to which it is applied, and 
vice versa. Thus the book is at the same time a textbook on the problems 

I Schumpeter, J A Bwsineta Cydett V6U i ani S. New York: McGraw-Hill Book Co., 1939. 

* Cf Goodwin, Richard. *Ixmoyationa and Irregularity of Economic Cycks.” 22 JEcon SUA 28: 
95-104 My ’45. 



146 


AMERICAN STATISTICAL ASSOCIATION 


involved in assaying products through biological responses, and a handbook 
on the solution to these problems by means of probit analysis. In order to 
appraise the book, these two aspects will be considered separately. 

The book opens with a definition and discussion of the term biological as¬ 
say, as "the measurement of the potency of any stimulus ... by means of 
the reactions which it produces in living matter” (p. 1). The type of assay 
most fully treated in the book is that of the all-or-nothing response, such as 
lived or died. With graded doses of the stimulus, changing proportions of the 
test animals will respond and a sigmoid response curve usuaUy results. Around 
this curve, numerous biological and analytical questions arise, concerned 
with both the variation in the product being tested and the variation in the 
biological form used for test. Finney gives an excellent discussion of many of 
these problems. Some of these will be mentioned to give some impression of 
the scope of the book: the determination of relative potency of two products, 
and a consideration of situations in which the term “relative potency” has 
no real meaning; adjustment for natural mortality which occurs independ¬ 
ently of the product tested; the factorial design of experiments to avoid the 
weakness of considering one at a time, a series of factors which may not 
operate independently; the effect of mixtures of poisons which may be inde¬ 
pendent and different in their action, similar in their action, or sjmergistic; 
the case of a graded rather than an all-or-nothing response. This is a con¬ 
siderably more comprehensive discussion of the probleu's than is available 
in other books on bioassay, and will serve a very useful purpose in bringing 
together material that is scattered rather widely through the literature. 

The solution to these problems is oriented to the fitting of the cumulative 
normal curve by means of the probit transformation. The term “probit” 
(contraction of “probability unit”) was introduced by C. I. Bliss in 1934, 
for the deviate in a normal distribution with mean 5 and variance unity. His 
reason for assigning 5 rather than 0 to the mean was a desire to avoid nega¬ 
tive numbers. 

Finney points out in an historical sketch (pp. 41-47) tliat the method is 
much older than the name. He traces it back to 1860 and points out that it 
has since then been developed anew around problems in a number of differ¬ 
ent fields. His presentation of the method is, however, so interwoven with 
the problem to which he applies it that it may be well to state the funda¬ 
mental steps independent of that problem. These are: (o) determining 
whether a normal curve is suitable for describing the data, and if not, trans¬ 
forming the scale of measurement so that the distribution is normal; (h) sub¬ 
stituting for the accumulated proportions the corresponding probits, which 
will then be linearly related to the variate; (c) determining the parameters 
of the straight line relating the probits to the variate; and (d) performing 
goodness of fit tests, and setting up confidence limits about the parameters 
and the line. 

In recent years, J. H. Gaddum, C. I. Bliss, R, A. Fisher, D. J. Finney, and 
others have contributed extensively to refinements in the methods of estima- 
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tion, determination of the sampling variations of the constants, and plans of 
experiments related to this method of analysis. 

The book is designed to explain these developments to people dealing 
with the problem of bioassay. It “is written with the intention of introducing 
the probit method to many who have previously not ventured to use it, and 
of presenting some of its more recent developments to those who are already 
familiar with it” (p. 6). It assumes some but not an extensive knowledge of 
statistics, and is directed primarily to the applied person. The procedures 
are explained around arithmetic examples which will be especially useful in 
acquainting the biologist with the methods. Appendices present a systematic 
treatment of one example, with all steps clearly described, a mathematical 
outline of the theory underlying the method, and a series of very useful tables 
which facilitate the computation. 

Probit analysis is first presented in terms of a graphical method of de¬ 
termining the parameters of the straight line, and it is stated that this 
method is often adequate and serves especially the person who has a long 
series of estimations to make. The statement (p. 24) that “many experi¬ 
menters who make use of probit analysis spend time unnecessarily on arith¬ 
metic when eye estimation would suffice” is unquestionably true, and it is 
gratifying to have this point emphasized. 

The more exact maximum likelihood estimates obtained by a series of 
successive approximations are presented in Chapter 4. The reasons stated 
for using more exact methods are that the observations may be too irregular 
for any confidence to be placed in an eye estimation of the fitted line, or 
differences in weights of the observations may be hard to allow for, or the 
experiment may involve several different factors which are under test which 
call for more involved analytical techniques. The last reason is a sound one 
for using the more involved techniques in certain problems, and the second 
may occasionally be a justification, but the first is dubious. In general it is 
the very good observations, not the crude ones, that justify elaboration of 
the analysis. The fact is, that for an enormous number of problems in bio- 
assay, a graphical fitting of the straight line gives adequate estimates; for 
if the observations arc very smooth, the graphical estimates agree well 
with the maximum likelihood estimates, and if the observations arc irregular, 
the sampling errors arc so great that graphical estimations are within 
sampling range of the more exact estimations, even if they show a sizable 
arithmetic difference from them. This is true in the examples given by 
Finney. 

The question of graphical versus more exact estimates is intimately con¬ 
nected with the design of experiments, for it involves a balance of gains 
through time spent on refinements of estimates from a given experiment 
versus additional experimentation, at various stages of an investigation. As 
stated earlier, Finney has considerable to say on design of experiments in 
this field, yet he gives no guidance on this point, saying that “Only experience 
of the subject and of the experimental technique used can be a sound guide 
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in this matter” (p. 24). This is an issue on which the statistician is frequently 
consulted, and rightly so, by the experimenter; and the discussion of design 
of experiment should incorporate consideration of this point. 

Methods of analysis of the dosage response curve through other equations 
than the normal curve are mentioned but not seriously considered. It is 
stated (p. 47) that none has been found to compare in usefulness with the 
probit transformation. It is true that none has been as fully explored, and in 
this sense Finney’s statement may be accepted. There is perhaps no reason to 
expect a book on probit analysis to deal critically with other methods. It is 
to be hoped, however, that the virtual identification of the problem of bio¬ 
assay with a particular statistical treatment will not lead the applied man 
to the view that this is the "correct” analysis to the problem. Referring again 
to the title, it should be emphasized that this analysis is "a statistical treat¬ 
ment” of the problem, and not Hhe statistical treatment.” Since it is a widely 
used treatment at the present time, it is very useful to have this comprehen¬ 
sive summary of the procedures made available, and at the same time, a 
thorough discussion of the problems in one field where they are applied. 

Review by Joseph Berkson 
Division of Biometry and Medical Statistics 
Mayo Clinic^ Rochester, Minnesota 

T his is a very exceptional if not a unique book. I doubt whether ever be¬ 
fore an entire volume has been devoted to the statistical treatment of one 
particular curve. Confining the specialization still more, only one aspect of 
the function is considered, for it is not with the normal curve as such that 
this book deals, but only with its integrated form. Singleness of purpose is 
further exhibited by the technic of analysis dealt with, for the author gives 
attention, not to the multitudinous sorts of mathematical methods that con¬ 
ceivably can be applied to the curve, but to one particular type, which is 
called "probit analysis.” Within the probit scheme he largely confines himself 
to the statistical methods advanced by R. A. Fisher and his students. And 
finally, the data considered are only those of biologic assay. 

Such a concentration of topic and method affords an opportunity for thor¬ 
ough and careful presentation, and this opportunity the author has embraced 
and exploited effectively. The book as a whole is well planned pedagogically; 
the simpler elements are expertly separated out for initial presentation, and 
step by step the more intricate aspects are dealt with in successive chapters. 
The chapter headings may serve to convey this progress concretely: (1) In¬ 
troductory; (2) Quantal responses and the dosage-response curve; (3) The 
estimation of the median effective dose; (4) The maximum likelihood solu¬ 
tion; (5) The comparison of effectiveness; (6) Adjustments for natural mor¬ 
tality; (7) Factorial experiments; (8) The toxic action of mixtures of poison; 
(9) Miscellaneous problems; (10) Graded responses. In Appendix I is given a 
detailed description, in terms of an example, of the computational proce- 
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dures; and in Appendix II an account of the mathematical basis of the probit 
method. There are ample tableSi a good list of references, and a good index. 
I noticed only a few misprints, such as the reference to equation 4.1 on page 
55, which apparently refers to an unnumbered equation on page 53; and the 
year 1940 instead of 1941 given for an article by Garwood. In the list of ref¬ 
erences the absence of the article by Galton^, which presented Shepard’s cal¬ 
culated table of normal deviates, is conspicuous because it is from this that 
the probit tables used today are in all likelihood derived. 

The basis of the method is the linear transform of the integral of the nor¬ 
mal curve, the value of the latter up to X being considered to give the per¬ 
centage response. The probit Y is obtained by adding 5 to X, in order to 
avoid negative values in the use of the transform. The transformation makes 
it possible to represent the relation of the percentage response to the dose 
by a straight line and to pose the problem of fitting as essentially one of 
linear regression. The use of linear transformation for curve fitting itself is 
not novel and is familiar to everyone, at least in the form of fitting an ex¬ 
ponential curve by plotting the logarithm. The method was used even for 
the integrated normal curve long before the initiation of “probits.” How¬ 
ever, rot until Fisher and Bliss devised an appropriate system of weights 
and "working values,” especially the last, was any attempt made to carry 
over exactly into the transformed scale, the computations that necessarily 
are defined in terms of the original values.* 

With a precise linearization thus accomplished, the way was cleared for 
the analysis of biologic assay, the responses of the latter being considered 
to be represented by the integrated normal curve, by applying to it all the 
statistical technics that had been worked out for straight lines. This included 
in its ramifications almost the entire gamut of methods elaborated by B. A. 
Fisher, as the list of chapter headings given previously suggests. Since the 
chapters are comprehensive and clearly written, the volume serves as a good 
text for the use of these methods themselves, apart from their application to 
bio-assay. 

If one considers it timely to offer a didactic text on probit analysis, as here 
delimited, to represent bio-assay, then this book is a job well done. However, 

1 Qalton, Francis. "Grades and Deviates; Inclading a Table of Normal Deviates Corresponding to 
Each Millesimal Grade in the Lenjrth of an Array, and a Figure.” Biomein'ka 5:400-6 Je *07. 

* The consideration of weights is related to what I believe is an erior in the text. On pages 53-54 we 
read: "This quantity is Snw(y —5)»reduoed by [Snw(a; —*) (g -■ S) ]*/Snw(ar —f )*; the calculation is shown 
in Table 6, and x*b 1.67, which differs from the x^of Table 7 only on account of rounding-off eirors.* 
The value of Table 7 refened to, 1.62, is the actual xs of the observations in relation to the fitted curve 
(o *—t)® 

defined slb2 ^—~—' In the probit iteration, at each step, we use, for the regression being determined, 

approximate weights and working values derived from the previously determined regression. It is the 
sum of the squared residuals of the working probits around the fitted line thus falsely weighted that is 
given by the expression referred to The x* directly calculated refers to the actual observations and is 
equal to the sum of their squared residuals with the weights of the present solution. These two sums will 
not be equal, no matter how many decimal places are carried, until finally when the maximum likelihood 
solution is obtained. In fact a signal, and a useful one, as to whether this point has been reached is just 
this comparison. 
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I myself doubt the appropriateness of such a volume. That the probit is a 
handy and generally good-enough tool for dealing statistically with the data 
of biologic assay need not be doubted. If easily applied methods for fitting 
are utilized and if the associated formulas are considered as suggestive only 
and are used with ample safety factors, no harm, and very probably much 
good, can come of its use. But the impression often conveyed by its devotees 
that this particular transform is a thing unique, possessing some intrinsic 
superiority, and that the methods used are mathematically “exact” and there¬ 
for unexceptionable, is open to question. 

Out of some 200 pages of text, the author devotes only about one to curves 
other than the integrated normal curve and says: “None of these relationships 
has yet been found to compare in usefulness with the probit transformation.” 
It would be more accurate to say that no comprehensive examination has 
been attempted of the comparative merits of different transforms and that 
such comparisons as have been made have not been examined adequately 
if at all by the author of the present text. Some studies indicate that there 
are certain advantages in other transforms. The angular transformation, 
for instance, has been found simpler to fit® and the logistic not only easier to 
fit but better in accord with bio-assay data.^ Before a particular curve is 
adopted as the statistical norm, do we not need to know whether it repre¬ 
sents the actual physicochemical mechanism of the bio-assay reaction; that 
is, do we not need a study of the differential equations representing the theo¬ 
retical mechanisms involved? As a minimal responsibility it would seem that 
the author of so definitive a text should himself have ascertained which of 
several proposed curves fits the data best, before committing a public that is 
gullible with regard to any statement printed in a textbook to the curve that 
happened to have emphasis in his statistical schooling. In fairness, however, 
it should be said that the methods worked out for the probit transform can, 
with appropriate modification, often be applied to other linear transforms, 
should it be desired. A statement to that effect would, I believe, strengthen 
the present work. 

The sectarianism shown by the author in regard to the choice of curve is 
exhibited likewise in respect of the statistical methods advanced for apply¬ 
ing it. The method advocated for fitting the curve is that of maximum likeli¬ 
hood, as proposed by R. A. Fisher, rather than least squares, on which most 
of us oldsters were brought up. For the situation of biologic assay as dealt 
with here these two methods do not give the same estimates, as the author 
clearly recognizes (p. 54).® An example for which I have made some neces- 

* Knudsen, Lila F., and Curtis, Jack M. *The Use of tke Angular Transformation in Biological 
Assay.* J Am Stat Am 42:282-96 Je *47. 

4 Berkson, Joseph. 'Application of the Lottie Function to Bio-Assay.* J Am StaA Assn 39: 357-65 
8'44. 

s This apparently has not always been clear to writers on the subject. Thus we find Irwin and 
Cheeseman {Sup J Royal Stat Soe 6:174-85 *39) saying in an article which is often referred to: ' ... 
as is to be e:Q>ected from the efficienoy of the mammum likelihood solution, wMoh in this case is the or¬ 
dinary least SQuare fit.” 
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sary calculations is one used by Fisher and Yates® and for which Garwood^ 
calculated the maximum likelihood solution by the probit method. Four suc¬ 
cessive approximations were required before constancy of the estimates of 
the parameters was achieved. For the first provisional solution the estimates 
of a and h were respectively 4.600 and 0.600; for the fourth solution they were 
4.566 and 0.713, showing an appreciable difference between the first and last 
estimates. Now the likelihood was highest for the last solution, but the suc¬ 
cessive values of x® beginning with the first provisional solution were 3.54, 
3.67, 3.77 and 3.79. The x® test is often used as a measure of goodness of fit, 
and its use is in fact a part of the prescribed procedure of probit analysis. 
By this criterion, in the successive iterations in this example we were la¬ 
boriously moving toward a worse and worse solution! Which solution is the 
"best”: the first, of which the value of x® was smallest; or the last, for which 
the likelihood was largest? Which of the two estimates has the smaller vari¬ 
ance?® And docs not the use of the x® test require that no part of any possible 
excess of x® found be due to the method of estimate used? 

These are not impertinent questions; indeed they must be answered before 
we can say that the basic procedure used in probit analysis rests on a logical 
foundation. For the difference between graphic fits by eye and the exact 
maximum likelihood solution is of the same order as the difference between 
minimum x® and maximum likelihood estimates. If we are to proceed from 
fitting by eye, using a rather laborious arithmetic procedure, toward a more 
refined solution, it can only be because we are assuredly moving toward a 
better solution. But if, as is possible, the minimum x® solution is the better, 
then in seeking a maximum likelihood solution we are, as would be illustrated 
in the case of Garwood, actually achieving a worse estimate than the one 
with which wc started. 

My own questions regarding the probit method for fitting are for the most 
part on an intuitive and “common sense” basis and are little assisted or em¬ 
barrassed by knowledge of the higher mathematics. However, misgivings evi¬ 
dently have been felt by the mathematically mature and competent. Thus, 
speaking of the probit solution, Thompson® said: “A general theorem of con- 

« Fisher. Ronald A., and Yates, Frank. TabUa for Biclogicdl, Atpriniltural, and Medical 

ReseareK Second Edition. Edinburgh, Scotland: Oliver & Boyd, Ltd., 1943. 

7 Garwood, F. *'The Application of Masmum Likelihood to Dosage-Mortality Curves.” Bwtnebrika 
32:43-58 Ja '41. 

* To many the question of which has the smaller variance will appear to be authoritatively answered 
by Finney in the following quotation (p. 209): "Fisher (1922, 1925) has shown that estimates of the 
parameters which maximise the likelihood are efficient in the sense of having minimal sampling variance 
in large samples.” Since for the case in discussion the maximum likelihood and minimum x* estimates are 
different, it would appear that either both their variances are equal or the variance of one is smaller 
and that the one given by maximum likelihood. Inquiry discloses, however, that sudi nmple interpre¬ 
tation reveals a lack of insight into the d^phic implications of En«^ish when spoken by mathematioiaiis. 
For it appears that minimum x* estimates are also "efficient” estimates and apparently it is not in con¬ 
tradiction with the quoted dictum that they may give for samples of say 1,000, estimates with smaller 
variance than those of maxi m u m likelihood. 

• Thompson, William B. "Use of Moving Averages and Interpolation to Estimate Median-Effective 
Dose.” Baeteriol B 11:115-45 Je *47. 
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vergence has not been proved for any set of points (log. pi), or with cer¬ 
tain specific exceptions, where we are required to fit a given curve or a 
straight line in transformed coordinates in the indicated manner. Indeed, the 
general theorem without such exceptions can easily be disproved . . . and ref¬ 
erence to it as 'the exact' method . . . may lead to a false appreciation.” 

Similar doubts will assail one on careful examination of many other phases 
of the probit analysis. On the basis of practical experience I have always felt 
instinctively that the use of observed values of zero or 100 per cent when 
these are asymptotic values is hazardous in curve fitting procedures. Others 
apparently hold similar views; for instance, Knudsen and Curtis in advocat¬ 
ing the arcHsine transform advise against the use of any observations out¬ 
side the range of 0.05-0.95. Cannot the basis of this hesitancy to use the ab¬ 
solute rates be found in mathematical considerations? In assuming that the 
integrated normal curve represents the true quantal responses to increased 
dosage, it is implied that for very large samples, a rate of zero or 100 per cent 
will not occur except with infinite doses. This is of course in marked contra¬ 
diction with actual observations, in which such rates are always encountered 
with finite and even moderate doses. In his basic article on "The case of zero 
survivors” Fisher^® does not evidently make any allowance for this difference 
between theory and fact. But it is not to be taken for granted that this dif¬ 
ference between the mathematical model and reality is without practical 
consequence, and that it may be disregarded without biasing the estimates. 
In view of the practical worker's hesitation to use absolute rates, the burden 
of proof would appear to rest with those advocating their inclusion in statis¬ 
tical schemes. Do we not have the right to require of the mathematical stat¬ 
istician that he demonstrate that the consequences of likely departures from 
theory are negligible, before developments based on the theory are consid¬ 
ered definitely established? 

The fallacious practice of appl 3 dng a test of significance by first offering lip 
service to the doctrine that a nonsignificant P (not low, that is) docs not es¬ 
tablish the hypothesis tested but only fails to disprove it, and then acting 
as if it did establish it, a fallacy that has been warned against repeatedly by 
authorities in mathematical statistics,^^ is illustrated in Finney’s book in the 
use of parallelism of probit lines for estimating relative potency. First paral¬ 
lelism is assumed in order to make the estimates, then a test is performed 
for parallelism on the basis for instance of two degrees of freedom (p. 71). 
Mirahile didu, the P turns out not significantly small. So the estimates based 
on the assumption of parallelism are accepted. Before such a test can be con¬ 
sidered to rest on a sound mathematical foundation we must have some 
knowledge of the power of the test relative to the alternative of nonparallel¬ 
ism. How large a departure from parallelism must exist, I wonder, before the 

10 Fisher, R. A. *Tbe Case of Zero SnrviTors.” An appendix to *The Calculation of the Dosage- 
Mortality Curve,” pp. 13d-67, by C. I. Bliss. Ann Appl Biol 22:164-5 F '35. 

See, for instance, A. Wald's review of M. G. Ifondall's The Advanced Theory of StatietieB, Vol. II in 
J Am Slat Aeon 42:185-6 Mr *47. 
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X* test, as used here on the basis of two degrees of freedom, will probably 
show a significantly small P? And is it not a fact that departure from parallel¬ 
ism much smaller than this wdl render seriously inaccurate estimates of the 
parameters which have been made on the basis of an assumed parallelism? 
Perhaps Finney will have this calculation for us in the next edition of his book. 
Or perhaps he will find that parallelism cannot be tested effectively by means 
of tests of significance based on two degrees of freedom. 

Another matter inviting attention is the tacit assumption made through¬ 
out the text that there is no error in x, the measure of dosage. Actually this 
is an idealization, for to err is human, even in the direction of the abscissa. 
Do we not need a mathematical investigation to ascertain the possible ef¬ 
fect of such errors? So far as I know, no such inquiries have been made, but 
it is not hard to see intuitively that such errors, even if moderate in size, 
may play havoc with many of the formulations set forth as “exact” in this 
book. 

Aside from such specific questions, I sense a broad uncertainty as to 
whether the structure of assumptions as a whole underlying the formulas is 
warranted for the data as they actually are encountered. It is notable 
that this entire book contains not a single repetition of experiment to test 
whether the theoretical error formulas represent the real variations which 
occur in repeated trials. My own practical experience, admittedly small, 
indicates that the formulas underestimate the actual errors considerably 
and sometimes vastly; and this conclusion is confirmed by veteran workers 
in the field of bloassay. Sometimes this sort of objection is met with the 
rejoinder that the formulas give only the “statistical” or “sampling error”; 
but, if dosage to patients is guided by measures of error as given by these 
formulas and a patient succumbs from an overdose, it will be little consola¬ 
tion to reflect that it was not the sampling error that killed him. 

For the reasons cited and other similar ones we cannot be certain that 
probit analysis furnishes the exact method for bioassay nor even that, of 
the methods proposed, it necessarily is the best. At the moment about all 
we can be sure of is that it is the most elaborate. 


Manual de Estatistica. (Manual of Statistics.) Amaro D, Guerreiro. Institute 
Nacional de Estatistica, Portugal. Lisboa; Tipografia Matemdtica, Lda., 1947. 
Pp. 340. 

Review by Hugo Muench 
Professor of BiostalisHca, School of Public Health 
Harvard University^ Boston, Massachusetts 

T he author states that his publication was prompted by the limitations 
of the study of statistics in Portugal, and the scarcity of statistical 
bibliography in Portuguese. He is careful to define the book not as a compre¬ 
hensive text of statistics but rather as a supplement to courses given by 
statistics departments. 
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Given a book of this type, it is probably too much to expect a critical ap¬ 
praisal of the methods described since it is in the nature of a laboratory 
manual or of a mnemonic aid in the performance of calculating procedures. 
Even so, it is disappointing to encounter a wide range of statistical method¬ 
ology without presentation of rationale or caution as to conditions of legitim¬ 
ate use. It is even more disappointing to find, in the wealth of illustrative 
material, methods applied to conditions not fulfilling the underlying assump¬ 
tions under which such methods are justified. 

Normal correlation is, of course (and properly, within due limits of cau¬ 
tion) applied to material whose distribution is not strictly normal. From the 
pedagogic viewpoint, however, it seems to this reviewer that illustrative ma¬ 
terial should conform as closely as possible to prescribed conditions. It is 
therefore something of a shock to find methods of normal correlation and of 
partial and multiple correlation extensively illustrated by material which is 
inherently nonnormal. Incidentally, Pearson's “eta” is presented as a measure 
of “curvilinear correlation.” 

To begin with, the author’s presentation of distributions seems confused. 
Concepts of meso-, lepto- and platykurtosis are illustrated by a graph of 
three distributions, all apparently normal but with differing scatter. Later 
on, relationships between quartiles and standard deviations are stated as 
valid for “symmetrical” distributions rather than for normal, and a series of 
measures of asymmetry and kurtosis is presented without discussion. 

The lack of critical presentation is most felt, perhaps, in the long chapter 
devoted to index numbers, though its absence is felt also in the two final 
chapters devoted to curve fitting. The first of these, entitled “Analysis of 
Time Series” is given to methods of fitting curves by differences (the strange 
statement is made that “if first differences are normal, the logistic should 
be used”); by least squares (parabolas to the third order); by selected points 
(logistic); and by least squares of logarithms (exponential and hyperbola). 
Smoothing by the use of moving averages is discussed; cyclic functions are 
nowhere mentioned. 

The final chapter presents the fitting of various frequency functions. Bi¬ 
nomial distributions are somewhat vaguely linked to the normal curve which, 
for some reason, is transformed so that its maximum ordinate is set equal to 
one, rather than its area. The Pearson system of curves is not explained, 
but Elderton’s table of these functions and their characteristics is included 
and a Type I curve is fitted at some length. 

Three appendices present tables of chi-square, of functions of the normal 
curve (ordinate *»1 when x «0) and of the log Gamma function. There is an 
extensive table of contents, as well as indexes of tables and of graphs, but 
there is no alphabetic index. The table of errata fails to mention at least one 
mistake; the writing of the normal curve function on page 240. 

Mr. Guerreiro’s book should be recommended only to such students as 
have already acquired a good background in the use of statistical method 
and who therefore would be able to select, from the material here presented, 
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procedures applicable to the nature and the limitations of their data. Taken 
by itself it would in no way prepare a student to follow a book such, for 
example, as Jorge Kingston’s A Tearia da Indugdo EstcUisHca. 

An Outline of Statistics, Third Edition. Samud Hays (Formerly Lecturer in 
Commercial Subjects, Doncaster Technical College, Doncaster, Yorks, England). 
London S. W. 19: Longmans Green and Co. Ltd. (43 Albert Drive), 1947, Pp. vii, 
254. 8s. 6d. (New York 3: Longmans Green and Co. [55 Fifth Ave.]. $2.25.) Two 
retnews follow: 

Review bt Harold Nisselson 
Statistidanj Bureau of Census^ Washington, D. C. 

T his is the third edition of an English primer written for ^Commercial 
Students in our Technical Colleges” and the “general reader of economic 
literature.” To this end, it is intended to cover “the requirements of the 
National Certificate in Commerce and the various Accountancy examina-^ 
tions.” 

Roughly two-thirds of the text is devoted to a nontechnical discussion, 
in the traditional spirit, of descriptive methods in statistics. Of the re¬ 
mainder, half is given over to a consideration of published sources of statistics 
on England and the United Kingdom, and the balance to a set of routine 
exercises (with answers). The principal revisions consist of a new chapter 
dealing with rounding errors, an appendix on quality control, and the in¬ 
clusion of postwar sources of data. Since the book is almost pocket-size, the 
amount of space that could be devoted to any one topic is fairly limited. This 
is reflected in the progressively sketchier treatment of later topics; in par¬ 
ticular, time series, correlation and index numbers. However, the style is 
much more agreeable than this brevity would indicate. 

The remaining comments apply to the book as a text in statistics, rather 
than its adequacy for the examinations cited. The chapters on official and 
other published sources of data for England, and an additional one on sta¬ 
tistical measures (unit costs, turnover, etc.) in common business use, are the 
most valuable sections of the book and should prove very useful. By contrast, 
the technical material is far less satisfactory. In general, the approach to 
statistics is one of merely summarizing a set of data. Hence, it is considered 
a limitation of the arithmetic mean that for discrete variables (e.g., number 
of children per family) it may not “represent an actual item at all”—^the 
mean being interpreted as an observation rather than as a parameter of the 
population. There is almost no mention of the role of sampling variation in 
interpreting statistical data, and the choice of the few instances where such 
reference is made is quite unexpected. Thus, a formula is given for the 
probable error of a mean or total due to rounding of the individual figures 
involved, but there is no mention of the probable error of a sample mean 
or total as an estimate of the corresponding quantity in the population 
sampled. (The probable error due to rounding as given is incorrect, and the 
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statement is also made that the probable error is "the most likely value of 
the error to occur.”) Again, a two-page appendix explains the probability 
that a population proportion lies in a specified range based on the proportion 
observed from a sample; and an unsatisfactory cookbook rule is given for 
judging the significance of an observed correlation. The examples cited here 
are typical of a number of misstatements and unguarded remarks on meth¬ 
odological material which appear at all levels of sophistication. 

For teaching purposes the unconventional terminology is awkward: for 
example, "coefficient of dispersion” for the coefficient of variation, “ratio of 
variation” for the regression coefficient, definition of the first moment of a 
set of numbers as the mean deviation and the second moment as the stand¬ 
ard deviation. Moreover, even for a one-semester introductory course, exten¬ 
sive supplementation by lectures would be necessary. 

In future editions, the usefulness of this book could be greatly enhanced 
by pruning or eliminating entirely the chapter on rounding errors, and, if 
necessary, issuing the exercises separately. The space gained in this way 
could then be devoted to developing and illustrating the basic ideas of sta¬ 
tistical inference, and the relations between a sample and the population on 
which it is to provide information. 

Review by Edgar Z. Palmer 

Professor of Statistics and Director of Bureau of Business Research 
University of Nebraska, Lincoln, Nebraska 

T he preface to this little book states that it "is intended primarily for 
Commercial Students in our Technical Colleges and for students inter¬ 
ested in professional examinations.” It may be described as British in form 
and somewhat American in approach. The total amount of material included 
is scanty compared with most textbooks. The usual topics of collection, 
tabulation, and “diagrammatic” representation are touched upon, as well as 
averages, dispersion, index numbers, and correlation. In the chapter on 
time series most of the attention is ^ven to the seasonal index. 

The book gives evidence of careless preparation. It is stated that a death 
rate of 12.5 means a value anywhere between 12 and 13 (p. 22); a frequency 
polygon is smoothed by the use of a weird collection of curves joining the 
tops of the bars (p. 62); the third moment measure for skewness is given in 
terms of the cubes of the deviations, without indication of summation (p. 
98); moving averages, plotted at the ends of their periods, arc supposed to 
illustrate the smoothing process (p. 106); a square root sign appears without 
a figure under it (p. 205). Formulas are a crude combination of words and 
symbols, and fractions are mixed with decimals in a confusing manner. Sam¬ 
pling and quality control have been relegated to two appendices, the former 
topic including only the sampling error of a proportion, and the latter in¬ 
troducing for the first and only time the sampling error of the mean. 

It is probable that for the quality of student for which this is intended it 
is quite teachable. The writing is clear, and illustrations of computations, 
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tables, and graphs are generous. The only interest for American readers is in 
five short chapters on the sources of British statistics, including one chapter 
each on vital, employment, overseas trade, and price data. 


Market and Marketing Analysis. Myron 8. Heidingsfield (Assistant Professor 
of Marketing. Temple University, Philadelphia 22, Pa.) and Albert B. Blan- 
lenship (Managing Director, National Analysts, Inc., 15th and Locust Streets, 
Philadelphia 2, Pa.). New York 10: Henry Holt and Co., Inc. (257 Fourth 
Ave.), 1947. Pp. x, 335. $3.00. Two reviews follow: 

Review by Lester R. Fbankel 
Statistician, Marketing and Research Division 
Dun A Bradstreet, Inc., 290 Broadway, New York 8, N. Y. 

T he use and application of marketing research to the solution of business 
problems has received general acceptance during the past few years. Since 
many of the procedures used in this field are based upon somewhat special¬ 
ized statistical techniques, a warm welcome would greet a book, such as this 
one, whose purpose is to elucidate some of these methods. 

In spite of the excellent choice of topics, the orderly presentation of sub¬ 
jects and the simplicity of exposition, this book, as it now stands, fails to 
serve the above purpose. This results from the fact that the text contains 
an unduly high proportion of mistakes, misinterpretations, and confusing 
statements. Some of the errors can be detected by the bright student or 
astute businessman. Other statements are inconsistent with correct statisti¬ 
cal usage. Businessmen with some knowledge of accounting procedures will 
find some of the material confusing. Students of marketing who read the 
book and then decide to specialize in statistics will have to unlearn many of 
the techniques which are incorrectly described here. 

The material covered in the text is presented in a logical and systematic 
order within the framework of five parts. The first part, consisting of three 
chapters, is introductory in nature. It discusses the need of research to the 
efficient operation and planning of a business concern and points out that 
sources of information are available from both internal and external sources. 
Internal analysis, it is explained, makes use of current and past operating 
records of the concern. Such records are usually in the form of balance sheets 
and profit and loss statements. In external analysis use is made of surveys, 
statistical series and other sources outside of the concern. 

The second part deals with the use of internal analysis in the solution of 
marketing problems. Although the study of the operations, problems, analy¬ 
sis, and interpretation of facts pertaining to business concerns is somewhat of 
a science today, the two chapters in this part, “Analysis of Business Records" 
and “The Use of Standards in Internal Analysis," use business terms and ex¬ 
pressions quite loosely. The effect is confusing, whether the looseness is due 
to careless wording or to a lack of detailed knowledge of a very exacting 
subject. 
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On page 41, for example, it is said that ratios measure ^degree of sol* 
vency” and “percentage of borrowed assets.” “Degree of solvency” is not an 
established term in business or the economics of business, and the text does 
not make it clear whether it means that a particular business enterprise is 
50 per cent solvent, 70 per cent solvent, 90 per cent solvent, or exactly 
what. The text goes on to state that “degree of solvency” can be measured 
by the ratio of “assets versus liabilities.” Here we become still more con¬ 
fused, as we are not told what assets or what liabilities are to be included. 
We learn a little later that what is meant is current assets and current li¬ 
abilities, although on page 55, where the ratios are listed in fraction form, 
the fraction again is based on the very indefinite terms of “assets” and “li¬ 
abilities.” The ratio of current assets to current liabilities is no measure of 
such indefinite ideas as “degree of solvency” or “percentage of borrowed 
assets.” It is really a measure of the ability of a concern to meet its immediate 
liabilities on scheduled time, and even as such a measure, interpretation must 
be made with extreme care and from fairly wide knowledge or experience. 
The second expression, “percentage of borrowed assets,” has no meaning. No 
business concern borrows assets. Assets are purchased, and until the pur¬ 
chase price is paid, the liability for the purchase is on the books. The pur¬ 
chase might be raw material to be paid in 10 days, or real estate and building 
to be paid gradually over a period of 10 years. 

In several of the ratios which are mentioned, “net worth” is used as a 
component. Net worth is the dollar difference between all assets and all lia¬ 
bilities. Occasionally a concern has total assets of $1,200,000 of which 
$1,000,000 is an intangible item such as goodwill, trade marks, or formulae. 
If the $1,000,000 is deducted, we arrive at a term of “tangible net worth” 
which should be used in these compilations. Otherwise homogeneity, which is 
described as an absolute essential on page 64, is completely lacking. This 
is a matter of very considerable importance and no explanation of the dif¬ 
ference between net worth and tangible net worth appears in the text. As a 
matter of fact, the term “net worth” is used exclusively in the text. 

The material presented in Part 3 provides a very good summary of the 
use of sample surveys in marketing research. In many respects this part is 
the best in the book, especially from the viewpoints of clarity and accuracy. 
But, even here, we find some errors. For example, sampling is first introduced 
in the chapter on “Planning the Survey Operation.” At the beginning of this 
chapter (p. 83) we find the statement that, in sample surveys, “The impor¬ 
tant thing here is to make absolutely certain that those selected to represent 
the total population have all the social, economic, and other characteristics 
which typify the ori^nal group.” A few lines later we find the statement, 
“Statistically, this is usually secured by what is called a random selection.” 
Of course, in random selection we are never absolutely certain that the 
sample has all of the characteristics of the population. In random selection, 
we rely upon the statistical law of large numbers to approximate this objec¬ 
tive. 
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The issue of area versus quota sampling is discussed in the same part. It 
was gratifying to find that through an objective analysis of the two methods 
the authors succeed in conveying to the readers the fallacies of quota sam¬ 
pling. However, it is unfortunate that the authors persist in using the term 
"stratified sampling” sjTionymously with "quota sampling.” 

In the remaining chapters of this part we find excellent treatments of the 
problems of questionnaire development, and of the collection and tabulation 
of the data. We are given the benefit of the authors’ wide experience in this 
field. 

"Special Applications of Eesearch Technique” is the title of Part 4, which 
includes a description and evaluation of the methods of establishing sales 
potentials and quotas, and a discussion of some of the problems of radio re¬ 
search. It is interesting to note that, in commenting upon the use of cor¬ 
relation analysis in determining market indexes, the authors state (p. 229), 
"This method has the disadvantage of requiring a trained statistician.” 

The part of the book dealing with "Technical Statistical Procedures” is 
the part containing most of the errors. Some of the errors are due to careless¬ 
ness and others are due to the authors’ eagerness to describe techniques 
which they evidently have never used. The first chapter in this section de¬ 
scribes fundamental statistical techniques, material which is found in all 
elementary texts on statistics, and yet there are a number of errors. For 
example, in describing the methods of computing the arithmetic mean and 
median, a series of 33 numbers is grouped in the form of a frequency table. 
In computing the mean the midpoint of the class interval, say 13 to 15, is 
taken at 14. However, in calculating the median we find the statement, "The 
actual range of the scores represented by the 13-15 class interval includes 
those that arc barely above 12 and those extending up as high as 15 but no 
higher.” The class interval has shifted one half of a unit. However, the cor¬ 
rect answer is arrived at in this particular instance through the commission of 
another, but compensating, error. The position of the median item is com¬ 
puted by taking one half of the total frequency plus one instead of one half 
of the total frequency. 

There is confusion in the distinction between the standard deviation and 
the standard error. It is stated on page 273 that “The standard error, nu¬ 
merically, means just about the same thing as the standard deviation.” If 
50 per cent of the public has done a certain thing "there is no way to compute 
a standard deviation of such a figure.” The only thing one can do, sa y the 
authors, is to get the standard error which is given by the familiar 
formula. The distinction between statistical description and statistical infer¬ 
ence IS not recognized. 

The treatment of “Sample Size and Accuracy” is somewhat puzzling. It is 
emphasized at the outset that statistical formulas for sampling errors can be 
applied only when the sample has been drawn on a random basis. We are 
warned that the formulas do not apply in quota sampling and that modifica¬ 
tions of the random sampling formulas have to be made when stratification 



160 


AMERICAN STATISTICAL ASSOCIATION 


or area sampling has been employed. However, the authors state that once 
data have been collected, regardless of the particular sampling used, the 
sampling error can be computed by established techniques! They have 
evidently fallen into this fallacy because of their ignorance of the difference 
between the standard deviation and the standard error. 

In a table on page 2S0 illustrating the cut-off method for determining the 
size of an adequate sample we find that among 50 interviews 49.2 per cent 
answered “yes” to a particular question and that among 100 interviews 50.1 
per cent answered “yes.” Since such results are arithmetically impossible 
when each interview represents a single individual, we are led to believe 
that an interview may represent several individuals However, there is no 
indication in the text that this is the case. 

On page 290 the following statement appears: “The midpoint between the 
perfect positive and perfect negative correlation is indicated statistically by 
a correlation of .00. This means that there is no relationship whatever be¬ 
tween the variables being analyzed.” What should have been said is that 
there is no simple linear relationship between the variables. 

It is stated on page 295 that, “Mathematically, it is seldom sound to . . . 
[compute a linear correlation coefficient] with fewer than 100 cases.” Pro¬ 
vided the correct test of significance is used, it is sound to compute r with as 
few as three cases. 

On page 305 in discussing the uses of the chi-square test it is suggested 
that, “It might be used to determine the significance of sales estimates (as 
incorporated in the sales budget) against sales returns.” The comparison 
between an actual series and a theoretical series through the use of chi- 
square can be made only when both series consist of frequencies. 

It is unfortunate that this book did not receive a thorough review prior 
to its publication. The layout, presentation, and style are above par for a 
text of this type. However, because of the great number of errors present, 
it is not very well suited for classroom or general use. It is hoped that a re¬ 
vised edition of the book will soon appear. 

Review by Philip J. McCarthy 
Assistant Professor of Sociology, Cornell University 
Ithaca, New York 

T he topics considered in this book fall into three categories: the use of 
internal company records for market research, the application of survey 
techniques for obtaining data external to the company, and the statistical 
concepts which are frequently required in considering either or both of the 
other two topics. All three are well illustrated with practical examples. The 
Preface states that the book has been prepared with intentional simplicity 
for students in marketing research and for businessmen who buy or other¬ 
wise make use of commercial research. This, in itself, is a desirable character¬ 
istic of a book for beginners. However, when considered in conjunction with 
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the fact that much of the material on survey techniques has already been 
presented in a simple and more exhaustive manner by one of the authors,^ 
and with the fact that statistical techniques are very difficult to treat in a 
simple manner, this simplicity becomes the greatest shortcoming of the book. 

The discussion of internal company records as a tool for research is an 
innovation for a marketing research book. As such, it should prove particu¬ 
larly interesting and valuable for those individuals who have entered the 
field of commercial research through academic disciplines other than eco¬ 
nomics or business administration. However, it is doubtful whether the stu¬ 
dent of marketing or the businessman will find this approach new. Most 
students of marketing will already have covered the field of internal record 
analysis in more detail in their courses in business statistics and accounting, 
and it seems likely that the businessman would have long since been forced 
into a careful study of the internal records of his company. 

Survey techniques are skimmed over lightly, the presentation being very 
similar to and less detailed than that of the previously noted book by Blan¬ 
kenship. The authors do maintain a critical attitude throughout this section, 
and their comments on methods and techniques are unusually well chosen 
and to the point. They have perhaps leaned over backwards to stress the 
use of random methods in the design of sampling procedures, but considering 
the widespread use of other methods in marketing research, this can certainly 
do nothing but good. They do attempt to wrap all sampling methods up into 
four neat packages (an impossible task), and they do say nothing about 
how the businessman or the commercial research organization should deter¬ 
mine the limits of error which can be tolerated in any given survey operation. 
Unfortunately there is very little information on this topic, and some con¬ 
sidered thought needs to be devoted to it. 

The final topic covered in the book, namely statistical techniques, is an¬ 
other attempt to convey statistical understanding to persons without any 
mathematical background. The present treatment is not distinguished in any 
way, and when one considers that measures of central tendency, measures 
of dispersion, sample size and accuracy, correlation, analysis of variance, 
analysis of covariance, factor analysis, chi-square and sequential analysis 
are all touched upon in the short space of fifty pages, there is serious reason 
to doubt whether this section will serve any useful purpose. The interested 
reader might better be referred to some such book as How to Read Stalistica 
by R. L. C. Butsch (see reviews in this Journal, December 1940). 

O Gusto de Produ^SLo do Homem Adulto e Sua Varia^So em RelagSo k Mor- 
talidade. (The Cost of Production of the Human Adult and Its Variation in Rela¬ 
tion to Mortality.) Giorgio Mortara (Technical Advisor, National Census Com¬ 
mission, Brazilian Institute of Geography and Statistics, Rio de Janeiro, Brazil). 
Estudos Brasileiros de Demografia, Monografia No. 2. Rio de Janeiro, Brazil: 
Livraria Kosmos EditOra, 1946. Pp. 152, 40 cruzeiros. Paper. 

I Blankenship, Albert B Consumer and Opinion Researefi, New York: Harper dt Brothers, 1943. 
F;p. xi, 238. 
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Review by T. N. E. Gbbvillb 

Principal Actmrial Mathematician^ National Office of Vital Statistics 
U. S. Pvblic Health Service^ Washington, D. C. 

T his is the second of a series of monographs entitled “Brazilian Demo¬ 
graphic Studies,” being published under the auspices of the GetiSlio 
Vargas Foundation. Pareto in 1893 (Giomale degli Economisti, 1893, 2nd 
half- 3 ^ear, pp. 451-466) discussed the cost of production of a human adult, 
defined as the total cost of upbrin^ng of an entire generation from birth 
to age 15 or 20 (including the children who fail to survive) divided by the 
number of survivors at the selected age. Finding this cost to be only slightly 
less in countries with low infant mortality than in those having high infant 
death rates, Pareto erroneously concluded that a low infant mortality tends 
to be offset by a high death rate in later childhood and adolescence. The 
author of the present monograph conclusively disproves this contention, 
marshaling a mass of data derived from life tables for a number of countries 
at various periods to show that a reduction in infant mortality has invari¬ 
ably been associated with a corresponding reduction at subsequent ages. 

Postulating a scale of relative living expenses by age increasing in geo¬ 
metric progression from 26.79 in the first year of life to 100.0 for the twen¬ 
tieth and each subsequent year, he obtains values, in terms of the conven¬ 
tional units so defined, for the total cost of production (up to age 15) of a 
human adult in various countries and epochs, including the propoitionate 
share of the cost occasioned by those who died prior to age 15. Subdividing 
the result into two parts, corresponding to the proportionate share of the 
cost of upbringing of the survivors and nonsurvivors separately, he shows 
that even in those countries having the highest infant mortahty, the latter 
portion of the cost constitutes only slightly more than ten per cent of the 
total. This figure represents, therefore, the maximum reduction in the total 
cost that could, in theory, be effected in a given country by reducing the 
death rate in infancy and childhood. However, if one considers separately 
the percentage change in the portion of the cost which relates to the nonsur¬ 
vivors, there has been a spectacular reduction. For example, it was 64 per 
cent for white males in the United States between 1900-1902 and 1939. 

On the basis of available data, and making certain assumptions, the author 
attempts to illustrate his results in terms of concrete monetary values, esti¬ 
mating the average cost of production of a human adult under the economic 
conditions existing in Brazil in 1939 to have been between 8,700 and 8,750 
cruzeiros. At the rate of exchange then prevailing this would be equivalent to 
approximately $460 in United States money. 

The stj^le is lucid, and the argument convincing. In general, the conclu¬ 
sions stated appear to be amidy supported; in fact, the documentation, in 
some instances, is perhaps unnecessarily full, as some of the main conclusions 
are today hardly open to dispute. The author promises that a subsequent 
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monograph of the series will deal with the average productivity of an adult 
during his lifetime and its variation in relation to mortality. 


Forecasting Sales. 0. Clark Thompson, Conference Board Reports, Studies in 
Business Policy, No. 25. New York 17: National Industrial Conference Board, 
Inc. (247 Park Ave.), 1947. Pp. 47. Paper. Price on application. Two reviews 
foUow: 

Review by Chables F. Rods 
President^ The Econometric InstitviCj Inc. 

600 Fifth Ave., New York 18, N, Y, 

N o LONGEB than ten years ago sales forecasting by most companies was 
confined to tabulating the guesses or hopes of the sales department; as 
late as 1940 only a few companies recognized that their sales are closely 
geared to national or regional purchasing power and other economic variables 
that can be satisfactorily forecast. As the Foreword of Forecasting Sales 
states, 

Until recent times, most companies have contented themselves with crude 
appraisals of the general business outlook. The gradual development of 
economic and marketing research techniques, however, have provided 
businessmen with an opportunity to make more accurate sales forecasts 
than had previously been possible. Today, numerous companies regularly 
forecast annual sales within 10% of actual performance and often come 
within 1 %. 

A reahzation of the value of such forecasts, together with increased con¬ 
fidence in present-day forecasting techniques, has encouraged many com¬ 
panies to attempt to improve their methods or to develop new ones. 

The main purpose of this report is to illustrate forecasting techniques cur¬ 
rently being used by companies in such diverse fields as food processing, 
metal fabricating and manufacturing, air conditioning equipment, furniture, 
electrical equipment, chemicals, rubber products and building materials. 
The methods illustrated for forecasting sales include the polling of executive 
and sales opinions, correlation analysis, trends and cycles and the use of 
mathematical formulas which last in reality involve the use of statistical 
complexes and accounting techniques. Actual examples are used to develop 
the techniques and, consequently, they are characterized by uneven degrees 
of statistical refinement and imagination. 

The first example presented under the section devoted to statistical meth¬ 
ods is that of a metals company. Joint study by outside consultants and a 
company specialist revealed a definite relation between the sales of the com¬ 
pany’s products and various special groupings of industries included in the 
Federal Reserve Board index of industrial production. The example pre¬ 
sented correlates the unit sales of Product “A” with the Federal Reserve 
Board index of industrial production and the ratio of automobile production 
to this index. Thus, the approach avoids the pitfall of using correlated varia¬ 
bles in a multiple correlation. The ‘^fit” between actual and estimated sales 
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is quite good, as it should be, because the product is sold to many industries 
but in volume to the automobile industry. This means that if the company 
can secure reliable forecasts of the FRB index of industrial production and 
of automobile production it can make reliable forecasts of the sale of Product 
“A.” The company obtains these general forecasts from a private research 
organization which specializes in this work. 

Another example compares the plumbing sales of a manufacturer with 
construction contracts. The agreement, of course, is generally good. How¬ 
ever, there are important deviations which seem too large to be ignored. The 
reviewer has found that these plumbing sales can be explained more satis¬ 
factorily in terms of Supernumerary Income (disposable income less cost of 
subsistence living) and construction contracts. The Supernumerary Income 
is the purchasing power available for purchases of plun bing supplies for 
modernization. This income is sometimes called Discretionary Income. 

The projection of trends is the method used by an electric power company. 
It projects the trends of (1) population, (2) meters or customers, (3) electric 
power consumption per customer, and (4) revenues. The trend of each is stud¬ 
ied separately and also in relation to the other three. As crosschecks or links 
the company also studies (a) the meter ratio (number of people per meter), 
(b) sales per meter, (c) revenue per meter, (d) revenue per unit, and (e) reve¬ 
nue per unit vs. consumption per meter. With the aid of mathematical for¬ 
mulas and a certain amount of judgment the trends of the link factors are 
projected. 

Industry replacement sales of automobile tires are forecast by a mathe¬ 
matical formula which includes the number of automobiles over two years 
of age, consumption of gasoline per car of these cars, a factor measuring tire 
quality, and sales of camelback, the unvulcanized rubber used for recapping. 
A curve can be readily constiucted from life tables to show the number of 
cais over two years old, the tire quality is known, and sales of camelback 
are not large enough to introduce much error. Consequently, the principal 
problem is that of forecasting gasoline consumption. For this purpose the 
company projects the trend of gasoline consumption per car by mathematical 
extrapolation modified to allow for special conditions which will, in the fore¬ 
caster’s judgment, affect the demand in future years. Much more accurate 
forecasts of gasoline consumption per car can, however, be made by relating 
this variable to disposable income and gasoline price and forecasting disposa¬ 
ble income. Gasoline price is related to this income, taxes, and refinery costs. 

Some companies use several methods for forecasting sales and then recon¬ 
cile the forecasts in meetings of executive or planning committees. In such 
instances, the forecast has broad authority and acceptance throughout the 
organization. While it is recognized that the forecast may still be wrong, it 
is accepted as the best estimate the organization can make based on eco¬ 
nomic and statistical information as checked by the business experience of 
the company’s top oflScers. 

It is now possible for companies to take the next step and plan sales budg- 
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ets and allocations by counties. The OflSlce Equipment Manufacturers’ Insti¬ 
tute has arranged with the Social Security Board to tabulate for each county 
in the United States employment, pajTolls and the number of establishments 
broken down to size groups. These will be provided for the main classifications 
of mining, manufacturing, wholesale trade, law offices, medical and health 
services, and nonprofit organizations. It will now be possible for companies 
to make studies of how their sales gear to these county data and then by the 
use of tabulating machines to set realistic sales budgets by counties and sales 
territories. 

This trend toward better forecasting, in the reviewer’s opinion, will prob¬ 
ably lessen the intensity of business cycles and will certainly lead to the 
dampening or elimination of seasonals. This is especially important at a 
time when union leaders are beginning to press for guaranteed annual wages. 

Review by II. D. Wolfe 
Director^ Market Research Department 
Colgaie-PalmoUve-Peet Co., Jersey City, N. J, 

T his small pamphlet on methods and practices of forecasting sales is one 
of the most helpful and illuminating contributions ever made on this 
most vital subject. Much has been written about distiibution costs and the 
fact that they may or may not be too high. Thompson, through exposition 
of method buttressed by actual cases of the practices employed by fifteen 
large industrial companies, demonstrates how accurate sales forecasts can 
lower distribution costs. Costs of production, selling prices, and profits are, 
of course, dependent on sales volume. The sales forecast of volume (either 
dollar value or phj’sical volume) affects the proper purchasing of production 
and packaging supplies, the finances needed, transportation facilities for 
which provision must be made, and influences the selling and advertising 
effort that must be expended. An accurate sales forecast can, therefore, result 
in the most efficient utilization (hence the lowest cost) of all the factors in- 
volvcd- 

There is a minimum of reference to actual statistical methods employed 
to achieve accurate sales forecasts. The reader is referred, in the appendix, 
to some of the standard works on statistical methods and business-cycle 
theories. The value of this pamphlet lies in its collation and description of 
the methods in actual use in 107 companies in the United States and Canada. 
In addition, rather complete case histories of particular methods utilized by 
15 representative companies are presented throughout the pamphlet; empha¬ 
sis is placed on the importance and the necessity of critically examining 
forecasts based on statistical methods. Judgment, experience, and the test 
of reasonableness must be applied to every forecast before the forecast be¬ 
comes an official figure of the company. As the author, in analyzing correla¬ 
tion methods so well puts it, "the more reliable the correlation, the more 
dangerous its use becomes, since there is a tendency to place blind faith in a 
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method that seems infallible. Even the best correlations are’^subject'*'to 
chance variationSj and one abrupt or severe deviation from normal may be all 
that is necessary to bring severe losses to a company.” 

Every company, no matter whether small or large, and whether or not 
they make official or formal sales forecasts, would be well advised to have 
this pamphlet read by every employee directly concerned with sales, ad¬ 
vertising, production, or financial budgets. No matter how advanced may be 
the thinking within a particular company with respect to sales forecasting, 
there is much material in the pamphlet which could probably properly be 
used in conjunction with the methods now employed by any such company. 

Specifically, the pamphlet describes the following methods of sales fore¬ 
casting, which may include national and territorial sales forecasts by product 
lines. 

Nonstatistical Methods, (a) The Jury of Executive Opinion. Under this 
method, top executives combine their experience and knowledge to arrive 
at an acceptable figure for budgeting sales. (&) The Sales Force Composite 
Method. Estimates are gathered by divisional managers from each of their 
salesmen and sent to the home office for examination and reconciliation by 
the top sales-management group in the home office. 

Statistical Methods (Correlation Analysis, Trends and Cycles, and 
Mathematical Formulas). When statistical methods are employed, the au¬ 
thor specifically recommends that such methods be kept simple as possible in 
order to assure their acceptance by the operating men who are to use the 
forecasts in the conduct of the company's business. If the factors affecting 
the sales of a particular company are complex and thus require complex 
statistical methods, the author recommends that the forecaster "devise 
means of instilling the necessary understanding and confidence in the ulti¬ 
mate users.” Constant review and revision of the forecasts must be made 
after the period of forecast has begun in order to check the actual perform¬ 
ance with the forecasted estimates and to adjust for any change in conditions 
that may have occurred after the forecasts were made. 

Multiple Method Approach. Under this approach, forecasts made by 
nonstatistical methods are compared with the forecasts arrived at from 
statistical analysis. The multiple method is highly recommended since it 
combines all the experience and knowledge within the comijany. As the 
analysis states, "the critical examination of differences between two or more 
independent forecasts sharpens executive judgment and tends to make the 
final forecast more reliable.” 

The only criticism of the pamphlet is the lack of recognition of the use of 
growth curves for short-term forecasts. Perhaps this method has not yet 
found wide usage, although its value has long been recognized for long-term 
forecasts. 

It is hoped that the author will find time to expand the material in this 
pamphlet as more information becomes available, and as the pattern of the 
postwar economy emerges more clearly. It may well be that prewar statist!- 
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cal correlations, based on a 100 billion dollar maximum economy, may not 
continue within a 200 billion dollar economy. 


Forecasting for Profit: A Technique for Business Management. WiUan Wright 
(Economist, Armstrong Cork Company, Lancaster, Pa.) New York 16: John 
Wiley & Sons, Inc. (440 Fourth Ave.), 1947. Pp. vii, 173. $2.75. (London. W.C.2, 
England: Chapman & Hall, Ltd. [37-39 Essex St., Strand], 1948.16s. 6d.) Two 
reviews follow: 


Review by William A. Spurr 

Professor of Business Stalislics^ Stanford University, Stanford, Calif(yrnia 

T his is an essay touching on many topics in business forecasting. The aim 
is to show the business executive, the business economist and the student 
how to predict a company's sales, profits, and prices. Toward this end Mr. 
Wright discusses the need for forecasting, types of forecasts, the factors 
which determine sales and profits, a method of making a general business 
forecast, the uses of national income data, the nature of money, the statis¬ 
tical series used by the Armstrong Cork Company, the effect of competition 
on a company's outlook, business cycles, "economic models," mechanical 
forecasting devices, types of price forecasting, the determination of turning 
points in sales and profits, criteria for classifying products, and the final as¬ 
sembly of the sales forecast. 

Mr. Wright thus points out the principal problems faced by the forecaster. 
His discussion appears plausible and persuasive. His viewpoint is that of a 
practicing business economist. However, I fail to see how one could learn 
how to make a business forecast from reading this book. There are two fun¬ 
damental criticisms. 

1) The treatment is too superficial. Each problem is merely mentioned in 
a general way, rather than being carefully analyzed and explained. Further¬ 
more, the topics are heterogeneous and unorganized, and terminology is 
vague. Definitions are fuzzy, and sometimes inconsistent with ordinary us¬ 
age. One gets the impression that the book was dictated in a very short 
time. As further clues to the author's thoroughness, the footnotes, bibliogra¬ 
phy, and index are rudimentary; the few charts show only hypothetical 
data, and the very few tables of actual data are brought up only to 1944. 

2) There are six essential elements in any good forecasting book; none of 
which is developed adequately here: (a) an explanation of pertinent sta¬ 
tistical methods, (5) business cycle description and theory, (c) a critical ex¬ 
amination of leading business indexes, (d) analysis of principal noncycUcal 
factors affecting business, (e) historical survey of recent economic fluctua¬ 
tions, (/) a detailed case study of a specific forecast, showing how the above 
principles are applied in practice. A business economist should do this last 
job well, but despite Mr. Wright's many references to his work at the Arm¬ 
strong Cork Company, he still fails to get across to this reader just how they 
make a forecast. (The treatment in pages 133-36 is just a start.) 
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As one illustration of both these criticisms, the statement is made on page 
83, that, “ it is helpful to break down or decompose by statistical manip¬ 
ulation an index of business activity into component elements.” Very well, 
then what methods of time series analysis are most appropriate and how may 
they best be applied in practice? Mr. Wright's only explanation is, “Statis- 
tistical decomposition involves calculating the vaiious equilibrium values 
in a series and the mathematical difference of each.” 

The reader, however, should look at both sides of the matter by contrast¬ 
ing this review with one which appeared in the American Economic Review 
for December 1947 (pp. 974-77). The reviewer there outlines Mr. Wright's 
thesis at some length and calls it ^competent and thorough. .. well or¬ 
ganized, logical.” 

To resume: if this treatise is lacking, then what is a good book on business 
forecasting? The prewar books are now rather dated, unfortunately, because 
of the rapid rate of obsolescence in the data, methods and strategic factors 
used in forecasting- Three books have been published since the war: (1) 
Dewey and Dakin's periodicity study, Cycles: The Science of Prediction 
(Henry Holt and Co., Inc., 1947),which is definitely misleading, in my opin¬ 
ion. (2) Wright's Forecasting for Profit, which is innocuous but thin. (3) 
Bratt's BtLsiness Cycles and Forecasting, Third Edition (Richard D. Irwin, 
Inc., 1948) which I would recommend as the only adequate treatise of the 
three. 


Review by Victor von Szeliski 
Vice President, U.S. Economics Corporation 
10 Rockefeller Plaza, New York 20, N. Y. 

T his book is recommended to business managers and to professional and 
academic economists. It is not so much a description of forecasting de¬ 
vices as an inventory and broad description of those parts of the social sci¬ 
ences which are useful to practicing business economists. A more descriptive 
title would be “Role of Forecasting in Business Management and the Kit of 
Social Science Tools Needed by the Company Forecaster.” 

The author correctly regards forecasting as a practical art and craft, in 
this respect disassociating himself from the school which regards it as a de¬ 
partment of mathematics. Forecasting is trying “to understand and desciibe 
the specific results of economic and political action taken by human be¬ 
ings. .. . The tools of the useful practicing economists are statistics of in¬ 
determinable accuracy, a knowledge of how people have acted in the past 
in situations which have been similar in some respects, a disciplined mind, 
knowledge derived as a result of critical observation, imagination, and com¬ 
mon sense.” So it is not surprising to find Mr. Wright recommending in 
the bibliography books by Pareto and Sorokin (but the reviewer does not in¬ 
tend to follow Mr. Wright's bibliography to the extent of reading the book by 
Major Angas). 
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The opening pages on the role of forecasting in business today and the po¬ 
sition of the forecaster-economist vis-i-vis management are a necessary in¬ 
troduction to any book on practical forecasting. 

In a brief paragraph on page 159, the author describes how forecasting has 
been integrated into the operations of the Armstrong Cork Company through 
a forecasting committee, how forecasting has been assimilated into the regu¬ 
lar operating routine. Perhaps Mr. Wiight will expand this tantalizing 
paragraph at some future time and explain the mechanics of how forecasts 
are translated into action in purchasing, sales, plant construction, and iSscal 
management. 

The key forecast, in conformity with which an individual company’s 
decisions have to be worked out, is the sales forecast. The author takes 
up in Chapter 3 the prerequisites necessary for developing a sales fore¬ 
cast, emphasizing the distinction between internal or controllable factors, 
and external or uncontrollable factors. Analysis of the external factors con¬ 
trolling company sales usually shows that the general business cycle is the 
dominant factor in company sales. 

The first step in business cycle forecasting is making an “estimate of the 
situation,” which is an evaluation of the strains or unbalances currently 
existing and the direction in which the economy would normally move as a 
result of these strains. The reviewer’s description of the book as a list of 
needed tools rather than a manual on how to use them is illustrated by the 
fact that the difficult question of equilibrium and tension is described in less 
than 4 pages. On the basis of the short description given, the author’s 
method of measuring equilibrium (use of trend lines) appears too mechani¬ 
cal. Equilibrium levels can certainly shift in ways which are not adequately 
measured by trend lines, and which can be explained statistically only by 
taking account of a number of factors. 

The author’s use of expectations is interesting—^the more so as reports 
on the expectations of business executives now appear in Fortune. 

More hints on diagnosing economic tension are given in Chapter 13. 
Maladjustments and disorganization within the supply of commodities are 
looked for; the magnitude of consumers’ stocks of durable commodities 
are to be appraised; their probable influence of prolonged trends are to be 
evaluated. Important recession and economic activity can be looked for 
when it is evident that excess productive capacity has been developed in 
a number of important industries. Residential construction is especially 
important because of the long, useful life of structures. “A large part of the 
success experienced in the practice of business forecasting may be attributed 
to competent analysis of the situation in, and outlook for, the construction 
industry, with particular emphaas on privately financed residential con¬ 
struction.” 

It is then up to the forecaster to figure out when and how fast and how far 
this tendon be released. 
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As a prerequisite for making useful, short-term forecasts, it is necessary 
to have a long-term forecast because the latter will condition the interpre¬ 
tation of the short-term occurrences and the prediction of the duration of 
short-term movements. The interpretation placed upon the expectation of 
business managers will also depend on the estimation of the long-term dis- 
equilibria which have to be corrected. 

Increases in labor costs and of material costs per unit of output may be the 
signal for the release of a tension by a decline in business activity. The au¬ 
thor also thinks that stock prices can be profitably analyzed for "measuring 
the effect of events on the expectations of persons who are highly interested 
in the trend of business activity and the magnitude of profits,’’ Maybe so. 
Mr. Wright regards the message of the stock market as especially important 
when the analysis of tensions indicates that a reversal of the trend of business 
activity is in the cards. 

The author devotes a few pages to economic models. "Being in the vogue, 
these instruments sometimes have been used to dignify illo^cal assumption 
with a mantle of precise arithmetic. Because of this practice, some experi¬ 
enced economists are inclined to lift their eyebrows at the use of these mathe¬ 
matical devices. They can be useful, however, for the purposes of persons 
who are interested in trying to forecast the development of actual situations 
in the economy.” The reviewer believes that the construction of economic 
models is absolutely essential for assuring one’s self that he has included (or 
for forcing one’s self to include) all major relevant considerations. A model 
should be thought of as a mosaic picture or a jig-saw puzzle— a, matrix into 
which an economist can fit the thousand-odd clues collected for his work; 
the individual pieces do not mean very much by themselves but when fitted 
together they make a picture that is clearer than the mere sum of the parts. 
In trying to make them fit, the economist learns, and improves his estimate 
of the situation. At present, however, craftsmanship rather than science must 
be relied upon in this necessary task because the mathematical methods for 
model construction still limp badly. 

The author also has chapters on industry competition and price forecast¬ 
ing. Raw material prices and selling prices arc of immediate interest to in¬ 
dustrial concerns, and so also are equity prices, as indications of how easily 
business can finance new capital expenditures with equity capital—^matters 
which are of extreme importance in those stages of the business cycle where 
the forecast resolves itself into a questionof how much business will spend for 
capital construction. 

The author scarcely discusses at all the opposite problem of how tension 
and disequilibria are created—how and why the economy moves from a posi¬ 
tion of equilibrium to one of tension. From the viewpoint of the science of 
economics, many if not most of the "why’s” are accidents; they are political 
or social in origin. 

Chapters 5 through 7 describe measures of national output, national in- 
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come, the makeup of the money supply, and some of the mechanisms that 
lead to changes in the money supply. Chapter 7 gives in detail the catalog 
of statistics in these and related factors, followed by the Armstrong Cork 
Company as background material from which forecasts are developed. 

The author then considers in Chapter 15 how the national business cycle 
forecast is translated into a forecast of industry and company sales—^by 
correlation techniques, for example. As a preliminary step, the author makes 
a formal statement of the assumption used in making the various estimates— 
assumptions drawn from analyses of the general economic pictures, as well 
as particular assumptions which have to be made about prices of specific 
commodities with which the company has to be concerned. After pre¬ 
liminary mathematical estimates of company and industrial activity have 
been set up, it is very important to refine these estimates by considering 
what deviations from the formula values are likely to obtain. “Guessing” 
what the residuals are going to be makes a high call on the craftsmanship of 
the economists. “The most accurate forecasting will be done by those who 
use judgment as well as mechanical aids in establishing estimated deviations. 
Personal judgment is requited in making a useful estimate of the situation, 
and judgment again is necessary to establish the expected deviations from 
computed values when the correlations are used in forecasting for practical 
purposes.” 

Deviations from the national pattern may be traced to at least four causes 
which the professional forecaster must evaluate carefully: (a) Secular change 
in acceptance of the commodity, (b) Change in customers' inventories, (c) 
Errors due to inaccuracies in the statistics, (d) Inadequacy of the inde¬ 
pendent variable as a measure of the market. 

Among the most interesting pages in the book to a practical forecaster are 
the summaiies or appraisals of economic situations of forecasts prepared for 
Armstrong Cork Company in 1931, 1932, 1936, 1937, and 1938. 
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LETTERS ABOUT BOOKS 


Readers are incited to submit letters about statistical methodology hooks for pMi- 
cation in this forum. Concisst informative letters which supplement prevumsly 
published reviews by pointing out specific strengths^ weaknesses, errors, and errata 
in currently used books are wanted. Criticisms based on actual use of a book as a 
text are especially desired from statistics instructors. Other letters may consist of 
suggestions for the writing of books and reviews. Letters which contain adverse 
criticisms of Journal reviews will be submitted to the author of the review for any 
reply he may care to make. Contributors are rerpiested to avoid personalities. The 
right to decide whether a letter merits publication is reserved. Letters should he sent 
to the review editor, Oscar K. Buros, Rutgers University, New Brunswick, N. J. 


THEORY OF STATISTICAL 
INDUCTION 

P BOFESSOR Kingston has kindly 
called my attention to the fact that 
certain of the errors mentioned in my 
review (this Journal, March 1947) of 
his book (A Teona da Indugdo EstatisH- 
ca, 1945) had, in the meantime, been cor¬ 
rected by the author in Revista Brasi- 
leira da Estatistica (No. 25, p. 56 and 
No. 26, p. 326). A rather complete 
comgenda reached me after the re¬ 
view was submitted for publication. 

I should also like to take this oppor¬ 
tunity to clarify the impression that 
my review has made on some readers. 
The first paragraph was intended to 
make it clear that this book was better 
than some which presumably reputable 
{publishers still add to the large English 
literature. Though I am not well 
acquainted with the field, it seems very 
likely that this is the best and most 
modern book in any of the Latin 
languages. 

John W. Tukby, Assistant Pro¬ 
fessor of Mathematics, Princeton 
University, Princeton, N. J. 

EXAMINATION OF INDUSTRIAL 
MEASUREMENTS 

I N THE June 1947 issue of this Jour¬ 
nal, two reviews of Examination of 
Industrial Measurements (McGraw- 
Hill Book Co., 1946) were presented by 
Professor C. W. Churchman and by 
Mr. Joseph Manuele and Mr. Roscoe 
Borers. The writer wishes to comment 
briefly on these reviews. 

Professor Churchman questions our 
reference to standard "betting odds” 
in limits recommended by the ASTM 
and the ASA. Here we are referring to 


three-sigma limits for indicating as¬ 
signable variations in control charts 
and fitted curves. Professor Church¬ 
man’s statement that "odds differ so 
widely depending on the product” ap¬ 
parently refers to allowable defectives 
or allowable quality variations in 
various products. We hope that he 
docs not infer, for example, that the 
cement industry is more willing to 
take chances with its customers than 
is the textile industry, the steel in¬ 
dustry, or any other group of manu¬ 
facturers. 

Professor Churchman’s comment on 
sample sizes from 300 to 600 failed to 
mention that these were specified for 
fraction defective only. We agree that 
this estimate is "arbitrary” along with 
other such choices made in industrial 
practice. Actually, our recommenda¬ 
tion is based on a statement in L. E. 
Simon’s well-known Engineer's Man¬ 
ual of Statistical Methods, and appro¬ 
priate reference was made in our text. 

We agree that the difference between 
sample estimates and true parameters 
should be emphasized, and our book 
follows the ASTM terminology in this 
respect. However, in industrial work 
we never know "true” values except 
as assumed standards. Formulae re¬ 
lated to the normal curve (or any other 
curve) are of no value to the engineer 
unless he applies them to actual data. 
In our experience, we have encountered 
more dangers due to the presence of 
assignably large variations than those 
due to samfihng error, assuming that 
the sample sizes and procedures recom¬ 
mended for control charts are used. 

Professor Churchman’s review well 
illustrates the difference in viewpoint 
between the statistician and the prac¬ 
ticing engineer. Wartime applications 
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have shown that engineers and others 
of considerably less formal training can 
grasp and apply the "logical basis” of 
industrial statistical methods without 
knowing all of the mathematical 
derivations. The definition of "logical 
basis” is a matter of degree, depend¬ 
ing upon whether such definition is 
that of the mathematician, the statis¬ 
tician, the engineer, or the plant 
foreman. 

Mr. Manuele's review states that 
"neither probability or risk is defined 
or mentioned” in the discussion of 
sampling. The three-sigma limit used 
throughout the book is certainly based 
on a very definite and strong prob¬ 
ability. The engineer who has con¬ 
structed a set of control charts from 
his own industrial measurements will 
thereby become acquainted with the 
sampling risks incurred, assuming that 
he has noted the table of odds listed 
on page 106 of the book. 

Although the writer is more inter¬ 
ested in applying proven methods than 
in developing new ones, our descrip¬ 
tion and use of median-normal limits 
and fraction non-random are, to the 
best of our knowledge, new. We shall be 
inteiested in references showing where 
these have been presented elsewhere. 

The detailed and helpful nature of 
these reviews is greatly appreciated 
by the writer. 

John W. Dudley, Jr., Chemical 
Engineert American Viscose Corp* 
oraiionf 85 South Ninth St,, Phila¬ 
delphia 7, Pa. 

M e. Dudley’s comments on my re¬ 
view point out very well the dif¬ 


ficulties that inevitably arise in the 
application of formalized techniques to 
research and industrial problems. Such 
techniques are only means to certain 
objectives of the experimenter or in¬ 
dustrial engineer, and may become 
very bad means, despite their preci¬ 
sion, in cases where the conditions of 
their application are not satisfied. The 
writer of a practical manual like Mr. 
Dudley’s little volume is faced with 
the two-fold problem of describing a 
precise technique for handling meas¬ 
urements, and at the same time ad¬ 
vising the reader on the restrictions of 
application. No one could hope to 
accomplish this problem completely 
within the scope of the volume, or 
even one many times as large. If a re¬ 
viewer adds a few precautions of his 
own, this is meant to supplement in 
some part the author’s own comments. 
Threc-sigma limits are not always ad¬ 
visable, no fixed sample size is uni¬ 
versally applicable, etc. The reader 
should be made aware as often as 
possible that the choices he makes 
result in operating characteristics, 
which form the real basis for his de¬ 
cisions. And I personally would think 
this was one of the most "practical” 
problems in industrial statistics; its 
solution does not demand a knowl¬ 
edge of mathematical derivations, but 
rather a knowledge of producer and 
consumer risks. A knowledge of con¬ 
sumer risks, at least, does take the in¬ 
dustrial engineer out of his field, but so 
much the better. 

C. West Churchman, Associate 

Professor of Philosophic Wayne 

University, Detroit, Michigan. 
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THE WAR PRODUCTION BOARD’S STATISTICAL 
REPORTING EXPERIENCE 

David Novick,* 

University of Puerto Rico 
AND 

GfiOBGE A. Steiner, t 
University of Illinois 

This is the first of a series of four articles that will present 
an exploratory analysis of the experience of the War Production 
Board and its predecessor agencies in developing and using 
data from industry for policy and administrative purposes 
during World War II. The present article describes the growth 
of the WPB’s reporting structure. 

INTRODUCTION 

I NTELLIGENT Wartime industrial control is impossible without ade¬ 
quate methods for collecting and using facts. In establishing policy 
for control over industrial production and distribution and in adminis¬ 
tering these controls, it is necessary for government continuously to 
make choices among competing demands for resources at all levels of 
economic activity. Facts and factual collection methods are managerial 
tools without which intelligent choices among alternative courses 
of action cannot be made. 

When the occasion arose for the national government to direct the 
industrial machine of this nation in World War II, there existed neither 
the data necessary to support basic policy decisions and administrative 
action nor the technical statistical discipline by means of which the 
data could be produced. The task of industrial mobilization and con¬ 
tinuous control of the American industrial machine during the war 
was unbelievably complex and difficult. But it was a problem im¬ 
measurably complicated by the absence of an efficient emergency in¬ 
dustrial statistical reporting system. 


* Formerly Director, Flogram and Statistics Bureau, War Production Board, 
t Formerly Director, Industnal Statistics Division, Civilian Production Administration. 
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In 1940 and 1941 it became clearer daily to those administering the 
industrial production program that neither the necessary facts nor 
satisfactory methods for getting them were available. Peacetime in¬ 
dustrial statistical series were, for purposes of wartime mobilization, 
incomplete in coverage, not timely, deficient in penetrating detail, 
subject to serious inaccuracies, and imcoordinated. In addition, peace¬ 
time methods of collecting and analyzing industrial series in many 
instances were found inadaptable to the needs of war. 

These deficiencies, although lamentable, are understandable. Peace¬ 
time factual needs, and methods and series to support them, are far 
different from those required for wartime administration. In time of 
peace the economy of this nation is not a centrally directed one but 
rather one in which the totality of economic activity is based upon a 
multitude of individual transactions free of government restrictions 
except those of a general policing character. As a consequence, no need 
exists for the government to collect data which reflect detailed eco¬ 
nomic actions. Rather, government is concerned more with information 
about over-all economic activity and data necessary to carry on specific 
administrative programs of various regulatory commissions and boards. 

This type of data and many of the methods used to collect them are 
not those required by war. In periods of crisis, national policy demands 
that the sum total of business operations accomplish specific and well- 
defined objectives. To meet these objectives the government is forced 
into day-to-day relations with business units. Government adminis¬ 
trators, as a result, need to collect data showing a balanced factual pic¬ 
ture of the industrial machine, in which each detail is related to the 
total and to every other. Business management requires a continuing 
flow of accurate and detailed information to make the kinds of decisions 
needed to conform to government regulations and to effect the maxi¬ 
mum contribution of the concem^s resources to the national objectives. 
These are crisis needs. 

Prior to 1940 the Federal government had two experiences with fac¬ 
tual needs of crisis administration. As part of the World War I program 
of industrial control, the Federal government foimd it necessary to 
introduce a large variety and number of industrial data reporting 
systems. These provided information necessary for the formulation of 
policy decisions and administrative actions relating to the distribution 
and production of specific materials and products. At the end of the 
war practically all of the wartime reporting structure was abandoned, 
and many of the lessons were buried with the dead files of that conflict. 

When the national economic machinery broke down in the 1930’s, the 



wpb’s statistical reporting experience 


203 


government found it necessary to expand its concept of public welfare 
to include both control over and direction of private business so that 
the net effect of business operations would be centered upon economic 
recovery. As an inevitable part of the regulations of such emergency 
agencies as the National Recovery Administration and the Agricultural 
Adjustment Administration, the government again developed a com¬ 
plex industrial statistical reporting system by means of which it could 
support basic policy decisions and administrative actions relating to 
business operations. With the invalidation of the NRA and the AAA, 
the data collections were not continued by either government or in¬ 
dustry and no study of emergency statistical methods was imdertaken. 

As a result, there was not available at the beginning of World War 
II the industrial statistical reporting discipline that emergency admin¬ 
istration was found to require. Starting from this point, industrial 
mobilization administrators began through experimentation, improvi¬ 
sation, and trial-by-error methods the laborious and costly task of 
constructing an industrial statistical reporting framework suitable to 
meet emergency management needs. It was only through bitter and 
time-consuming struggle that the War Production Board eventually 
developed methods and techniques by which its comprehensive report¬ 
ing requirements were reasonably adequate to meet its most urgent 
policy and administrative needs. 

In a matter of days following VE-Day the great majority of the WPB 
reporting instruments were discontinued, as they should have been. 
In peacetime years the government avoids day-to-day operating re¬ 
lationships with business. This also is as it should be. As a result, both 
the wartime statistical series and the methods and techniques developed 
to acquire and use them will tend to disappear. Undoubtedly some of 
the wartime experience will be incorporated into industrial and gov¬ 
ernmental peacetime statistical collections and methods. But the inte¬ 
grated discipline of a satisfactory wartime industrial statistical report¬ 
ing system mil find no peacetime use. 

This inevitable peacetime disuse must not result in unpreparedness 
for a possible future emergency. Adequate preparation must embody 
preservation of the principal lessons of World War II concerning factual 
tools and procedures required to manage successfully the economic 
system. But merely preserving the lessons of World War II ejqjerienoe 
is not enough. Many important tools of administration were still crude 
and blunt at the end of the war despite the whetstone grinding of five 
long war years. In time of peace these tools must be studied and im¬ 
proved so they will be razor-sharp for immediate use in any emergency. 
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Study of the experiences of the WPB in collecting, analyzing, and 
using industrial statistics, if executed at the proper level of professional 
proficiency, can save the nation both millions of dollars and years of 
time in improving its peacetime industrial reporting systems. Perhaps 
of more importance, neglect to take advantage of and build upon this 
experience will more than likely leave the nation, on the impact of any 
future emergency, without the factual tools required to administer 
economic matters. Failure to have available for immediate use in a 
future crisis an adequate reporting system may be catastrophic if the 
military movements in that emergency involve the sudden devastation 
now attributed to atomic bombs, supersonic projectiles, and guided 
missiles. 

The authors have tried to avoid the use of footnotes. The great bulk 
of source material, aside from individual questionnaires, is located in 
personal memorandum of the authors and memorandum of various 
WPB officials. Both sets of documents are generally unavailable. For 
those interested in pursuing further the analysis of methodology evi¬ 
denced in individual questionnaires, attention is drawn to the Catalog 
of War Production Board Reporting and Application FormSj prepared 
by the Industrial Statistics Dhdsion of the Civilian Production Ad¬ 
ministration in 1946. This Catalog, published in ten volumes by the 
U. S. Government Printing Office, reproduces and analyzes approxi¬ 
mately 1,200 of the WPB’s most important questionnaires. 

PART I 

GROWTH OP THE WPB REPORTING STRUCTURE 

To satisfy the needs both of policy-makers and of administrators for 
information to control production and distribution, the WPB created 
and used a reporting structure of unprecedented scope and complexity. 
This reporting structure was not built. It was assembled. It grew out of 
the needs of the moment; it was altered to reflect rapidly shifting ob¬ 
jectives, organization, policy, and personnel; and it was molded from 
the compromise of powerful opposing influences. The problems en¬ 
countered in its development were as complex and broad as the in¬ 
dustrial production system it was designed to harness. No single disci¬ 
pline or procedure ever animated all of its supposedly systematic parts. 
Its techniques were never completely mastered and its potentialities 
never fully realized. At no time did it fully satisfy the fundamental 
needs of policy and administration. 

The purpose of this section of the series is to reduce this vast and 
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intricate agglomeration of fact-gathering devices to rmderstandable 
proportions and by so doing to establish a background which may serve 
for a presentation of selected fxmdamental technical problems and 
lessons coimected with WPB statistical methods. To accomplish this, 
there is presented a broad panoramic sketch of the principal movements 
taken by the reporting structure through the war period. It is an analy¬ 
sis of the struggle to develop a reporting system to produce facts for 
wartime management, the principal problems faced in the struggle, 
and the attributes of the methods introduced to reach the goals set. 

Closing Information^ Gaps in 1940. In the year 1940 the day seemed 
remote when the United States would have to reconstitute its industrial 
system into a single-purpose war-production machine. Defense produc¬ 
tion for our Allies and our own armed forces at first was comfortably 
accomplished within available productive resources. Only gradually 
and in individual areas did it encotmter resistance. As a consequence 
the importance of an adequate statistical discipline to support indus¬ 
trial mobilization management became apparent rather slowly. Actions 
taken to develop a sound statistical order were at first, therefore, 
essentially in the nature of stop-gap measures. 

The first comprehensive factual research by the National Defense 
Advisory Commission was designed to establish broad industrial mo¬ 
bilization policy guides. Efforts were made to develop facts about the 
magnitude of economic resources available for war and nonwar de¬ 
mands, the level at which specific resources might cease to be adequate 
to satisfy potential demand, and the industrial and geographical areas 
in which new plant facilities should be built and located. Existing data 
proved to be inadequate for this work even though completion of the 
studies required only the broadest tjqje of information. 

Failure of this approach led to another which turned out to be even 
less fruitful. Military agencies were urged to develop complete statisti¬ 
cal pictures of their procurement and the claims which they were cur¬ 
rently making and contemplated placing in the future on scarce ma¬ 
terials and products. This, together with a comparable statement of 
nomnilitary demand, it was correctly reasoned, could provide much of 
the basic information needed for broad industrial mobilization policy 
decisions. By the time useful action was taken in developing such over¬ 
all statements of claiuds on industiy, the United States was already 
engaged in war! 

The earliest demand for detailed industrial data centered upon plant 
operations and capacity. Recognizmg the paucity of figures in govern¬ 
ment about such matters a twofold effort was made to get facts from 
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industry. First, various State Industrial Councils obtained information 
on plant capacities and equipment within their states. Second, the 
Bureau of Labor Statistics was asked to develop a repoiii on individual 
plant operations by which the NDAC could learn the extent to which 
plants were actually operating 24 hours a day, seven da3's a week. The 
State Councils accumulated a series of reports and submitted them to 
the NDAC but the results were inadequate for solving the statistical 
problems of that agency. The Bureau of Labor Statistics developed a 
questionnaire which requested monthly reporting from about 300 plants 
in the metal-working trades concerning man-hours worked, labor shifts, 
and weekend emplosnnent. Both programs feU far short of the factual 
requirements of the times. 

Hindsight reveals quite clearly that such amateurish methods could 
not provide the information required for a war. There was, however, 
no dearth of suggestions for getting the information needed. In the 
spring of 1940 and continuing into the autumn, for example, there was 
considerable discussion in the NDAC of the desirability of collecting 
from industry highly detailed reports as a part of what was vaguely 
referred to as “a modified German Plan.” It was assumed that the 
Germans, who were much deeper in war than the United States, had 
developed a machine by machine, man by man, pound of material by 
poimd of material, weekly or daily reporting system which by some 
t3q)e of tabulating magic was quickly converted into both an all-out 
summary for policj'-making and a completely detailed report for ad¬ 
ministrative decisions. The apparent success of German industrial 
mobilization led to the idea tW the United States should have the 
same sort of statistical system. Introduction of this type of reporting 
system, how'ever, inevitably would have been labeled a “bureaucratic 
deluge of paperwork” at this stage of the war effort. For this and other 
reasons the idea was abandoned. 

Indmdiial Preference Ratings. The search for information could not 
be allowed to delay the solution of current problems concerning acqui¬ 
sition of scarce materials needed to produce vital defense equipment. To 
meet these issues, the Army and Navy Munitions Board, with but lim¬ 
ited legal and executive authority, established in August 1940 a pri¬ 
ority plan based on preference-rating certificates. By means of these 
documents (PD-1,2,3,4 and 5) it became possible for producers hold¬ 
ing government contracts for defense equipment to apply for, receive, 
and extend preferred treatment to items they and' their suppliers 
needed to fulfill their contracts. A Priorities CriUcoA List was established 
to limit the materials and products to which preference could be ap- 



wpb’s statistical reporting experience 


207 


plied. The listing was relatively short and specified items in broad, 
general terms, such as: barometers, bombs, cranes, explosives, chromi¬ 
um alloy steel, aluminum and aluminum alloys, and tin. 

With a broadening and acceleration of defense production and ex¬ 
pansion in the materials and products for which preference was applica¬ 
ble, this system operated less effectively with each passing day. So long 
as sufficient materials and productive capacity existed to support both 
defense and unrestricted civilian production the system was effective 
in assuring to manufacturers with government contracts the materials 
they needed for production. But, as defense production mounted, it 
became increasingly difficult to control preference ratings so as to hold 
preference-rated demand for critical materials within available supply. 
The inevitable result was the rise of log jams among competing au¬ 
thorized demands in more and more products and materials. 

These log jams could not be broken by the individual preference¬ 
rating system. The system contained no automatic adjudication of 
competing claims at any level of manufacture when supplies of mate¬ 
rials and products were insufficient to fill all rated demands. In addi¬ 
tion, the rating certificates could not be used to produce the facts 
government could use to resolve the problems by administrative action. 
Not only was there little uniformity in data incorporated in rating cer¬ 
tificates, but the flow of paper reached such proportions that it was 
impossible to use the data for comprehensive policy or broad admini- 
trative action. 

Breaking the Log Jams. The first efforts to break these log jams fol¬ 
lowed admonitory rather than mandatory patterns. Policy decisions 
made on the basis of whatever data were available were translated into 
detailed regulations and control orders for which there was no imple¬ 
mentation in the form of reports transmitted between government and 
industry. It was only a matter of months, however, before problems in 
connection with such tight items as nickel-bearing steel, magnesium, 
aluminum, machine tools, and heavy industrial equipment became so 
acute that the inadequacy of generalized self-administered controls to 
effect the kind of production and distribution control required by the 
situation was generally recognized. Experience demonstrated that only 
the most detailed type of administration and centralized control could 
be expected to provide the desired results. 

In the autumn of 1940, the need for new detailed information to 
solve critical problems connected with individual products and mate¬ 
rials had become so urgent that a fresh approach to the acquisition of 
data was of paramount importance. Each of the Divisions of the 
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NDAC,^ and later the Office of Production Management, charged with 
responsibility for control of a particular material or product worked 
out new and more detailed programs for the collection of data required 
for daily administration of production and distribution controls. Those 
responsible for magnesium, for example, sought data about all the 
direct and indirect economic problems involved in the production and 
use of magnesium. The same pattern was followed by those responsible 
for nickel, alloy steel, machine tools, and other items in short supply. 
Centralized control over questionnaire issuance fortunately had been 
introduced in the early days of the NDAC and continued into the 
Office of Production Management. As a consequence, surveys actually 
introduced as a result of these programs were limited to those most 
obviously needed to solve an immediate problem. The country was 
thus spared the painful experience of innumerable data requests which 
inevitably would have overlapped, duplicated, and in other ways inter¬ 
fered with one another, not only in the data collection itself but also 
in the analysis and administration of the results. A trend was begun, 
however, to acquire more and more data about individual products and 
materials, each with little reference to the other. 

In March 1941 a more realistic approach was made by the recently 
formed 0PM (Office of Production Management). This involved modi¬ 
fication of the preference-rating system by the introduction of P, or 
"blanket preference rating,” orders, providing preference-rating assist¬ 
ance to classes of products; and M and E orders, directing the distribu¬ 
tion of basic raw materials and equipment. In September 1941L orders 
were introduced to facilitate operation of M orders by limiting non- 
essential demand for materials under M-order control. The great 
bulk of information pertaining to individual products and materials 
which was acquired throughout 1941 accompanied administration of 
these orders. 

P Orders. P orders provided for the assignment of ratings to classes 
of products, usually at the final product end of the industrial system. 
One type of P order permitted automatic assignment of ratings for 
procurement of material needed in the production of the classes of end 
products specified in the order. Such were the first orders, namely, P-1 
applicable to electric traveling cranes, P-2 applyiag to machine tools, 
P-3 for airframes, and P-4 for airplane engines and propellers. Each 


1 Throughout the text all subdiyisioxis of various agencies are specified as Divisions, for purposes 
of clarity, except in a few instances in which qpotations from orders incorporate the official title of a 
subdivwn which is other than a division. 
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order, with few exceptions, specified a reporting form upon which pro¬ 
ducers to whom orders were issued had to report periodically to the 
0PM. In addition, producers using or receiving preference treatment 
tmder an order were required to maintain records as specified in the 
order. This provision, however, was often vague and carried no compul¬ 
sion of compliance. This type of P order, on the whole, was more succes- 
ful in relieving a tottering preference-rating system than in developing 
satisfactory new quantitative measuring tools and methods for using 
them. 

The Defense Supplies Rating Plan, issued under order P-6 in May 
1941 was a different type of order and its methods were destined to 
become of vast importance in WPB operations. Certain manufacturers, 
such as those engaged in producing electrical motors, twist driQs, and 
mill supplies, received defense orders indirectly and on shorter delivery 
cycles than their manufacturing time. Deliveries customarily were 
made from stock, and replenishment of scarce materials for production 
of additional stock could not be made by extensions of preference cer¬ 
tificates for the small amounts of separate materials going into each 
individual motor or twist drill. The application of individual prefer¬ 
ence ratings to these producers was quite unrelated to the manufactur¬ 
ing procedures of their products. It was the purpose of P-6 to facilitate 
production for such manufacturers. 

Each manufacturer applying for aid under the order presented to the 
0PM, on form PD-25, the following information for his plant or sub¬ 
division thereof; past quarterly material consumption, material stocks 
on hand, and cuirent quarterly requirements for those scarce materials 
for which assistance in acquisition was requested; the amount of de¬ 
fense business; and the specific finished products for which scarce 
materials were needed. This plant questionnaire embodied rudimentary 
methods from which profitable development was to be made. 

Principles originated by the Defense Supplies Rating Plan were used 
again in August to aid manufacturers of various health supplies. Order 
P-29, Health Supplies Rating Plan, was issued to aid manufacturers of 
specified health equipment who were encountering difficulty in obtain¬ 
ing ratings sufficiently high to procure basic materials needed in their 
manufacture. Fonn PD-79, used to administer the plan, required from 
manufacturers plant reports which requested data about finished 
products to be made from scarce materials, quarterly consumption of 
and requirements for scarce materials, inventories of scarce materials, 
and withdrawals from stocks. With these data, as with the information 
acquired under the Defense Supplies Rating Plan, the 0PM was in a 
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position to appraise the aggregate needs of each plant and to issue 
ratings accordingly. 

To broaden the scope of the Defense Supplies Rating Plan, order 
P-90 was issued in December 1941 to launch the (voluntary) Produc¬ 
tion Requirements Plan. Its application form, PD-25A, represented a 
more advanced development of both PD-25 and PD-79. But data 
requested on PD-25A were also designed to enable a manufacturer to 
present a complete picture of his operations in relation to the war pro¬ 
gram and to essential civilian needs, and to state his production-mate¬ 
rial requirements for these purposes. So successful was the operation of 
this plan that by early 1942 approximately 7,000 plants were obtaining 
priority assistance under it. 

These plans operated on the basis of an individual manufacturing 
unit. They recognized the fact that industry was made up of segments 
of companies and that producers were usually engaged in the manu¬ 
facture of more than one class of products. The plans, therefore, were 
designed to provide priority assistance on a plant basis to facilitate 
procurement for a plant. Methods designed to acquire facts for these 
purposes were important in paving the way for future comprehensive 
plant reports. 

Rise of Individual Maierial Allocation Systems. In March 1941 the 
same month P-1 was issued, the first M order appeared “to conserve 
the supply and direct the distribution of aluminum.” It was followed 
in succession by M orders for magnesium, ferro-tungsten, tungsten 
metal powder, neoprene, nickel-bearing steel, and numerous other criti¬ 
cal basic materials. (E orders directed distribution of finished products, 
such as machine tools.) These orders embodied the idea that control 
of the distribution of basic materials through the industrial system held 
the key to maximum war production. To administer a system incor¬ 
porating this idea, producers of basic materials were required to file 
reports, usually monthly, requesting authorization to ship mill shapes 
and forms to customers. These reports requested detailed data on 
material shape and form production, orders by individual customers, 
and the products for which materials were ultimately to be used. 
Upon the basis of this information, administrators responsible for ma¬ 
terials under control distributed available supplies among competing 
programs and products. 

The dominant characteristic of control policy and industrial report¬ 
ing in 1941 centered about hundreds of little imiverses circumscribed 
by these orders. Each system in use followed a rough pattern but so 
differed in details from other systems that uniform control and statis¬ 
tical reporting were completely absent. 
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The steel plate allocation system represents a reporting system de¬ 
signed to control the flow of semi-finished products through a fabricat¬ 
ing bottleneck. As eventually developed, the reporting system re¬ 
quired each consumer of steel plate to list his requirements with his 
supplier one month in advance, specifying government contract num¬ 
ber, the use of the material, the preference rating, and the order 
number. A copy of this form (PD-298) was transmitted to the Steel 
Division, together with a statement of steel plate consumption and in¬ 
ventory (form PD-299). The steel plate producers, in turn, listed and 
sent to the Steel Division (form PD-169) all orders received and speci¬ 
fied those which were included in the rolling schedule for the following 
month. The Steel Division surveyed these orders and rolling schedules 
to determine whether any important military requirements had been 
omitted. In addition, an effort was made to review the schedules to 
msure the full utilization of the productive capacity of the steel plate 
mills. Reporting and analysis were centered only upon the needs of the 
Steel Division to make its decisions. These decisions were made in the 
absence of complete information about the status of aluminum, copper, 
and other materials, competing requirements in other areas, component 
schedules, assembly schedules, etc. 

Most individual allocation systems operated on the basis of detailed 
reports from industry. The typical material report form, or application 
for authority to ship raw materials, requested all or part of the follow¬ 
ing type of information from a material producer, on a monthly basis: 
(a) the part into which the material was to be incorporated; (b) the 
product into which the part was eventually to be assembled; (c) the 
use to be made of the product; (d) the government contract number, if 
any, of the finished product; and (e) other information, such as prefer¬ 
ence rating and end-use pattern. Producers generally submitted this 
information on a series of forms. All orders provided for a report, some 
for more than one. In addition to producer reporting, allocation systems 
frequently provided for reports from consumers interested in acquiring 
raw materials under control, warehouses stocking materials, dealers, 
and others through whom the material passed in its metamorphosis 
from crude form to final product. 

Among all the competing allocation systems there ran the thread of 
end-use. Most producers' reports requested data on the use which in¬ 
dustry expected to make of raw materials, e.g., carbon steel bar to be 
made into a valve stem for a specific type of valve for a destroyer. Anal¬ 
ysis of various forms created for the most important critical materials 
shows that there were as many different end-use reporting require¬ 
ments as forms. Each differed from the other. The fact of nonintegra- 
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tion and nonuniformity of end-use requests was negligible beside in¬ 
dustry’s inability to supply the desired data. Most of the information 
requested was known only to the assembler of the finished end product, 
and only a well-established end-use reporting mechanism could trans¬ 
mit the information through the contractual chain. But no such trans¬ 
mission mechanism existed. 

Each allocation system, operated autonomously by each Division, 
resulted in unbalanced flows of materials. Quantities and delivery dates 
authorized by the Steel Division might not bear any relationship to 
those authorized by the Copper Division, because neither the mills 
nor the Divisions consulted one another about the coordinated needs 
of the purchaser. Nor were they in any position to do so. Each group 
responsible for a material or product attempted to appraise the needs 
for its controlled item alone and to direct the use of its supplies through 
the industrial system in terms of what were considered to be priority 
needs, with little reference to operations among other allocation sys¬ 
tems. 

Although M orders provided better control over the distribution of 
scarce raw materials than preference ratings alone, they did not provide 
the balance that only a comprehensive over-all reporting system could 
achieve. Each group responsible for administration of an M order gen¬ 
erally was willing to acknowledge the need and advantage of an aggre¬ 
gate statement of demand and supply covering all materials and prod¬ 
ucts. But when this intellectual willingness was pressed with the sug¬ 
gestion that individual controls and specialized allocation systems 
should be abandoned in favor of aggregate reporting and over-all allo¬ 
cations, the bridge could not be crossed. The proposal was sacrificed 
to personal desires and individual notions of how control should be 
exercised. No support for over-all comprehensive reporting and alloca¬ 
tions was given by those who were skilled in their specialized fields and 
who had erected their own control structures. Indeed, to a very large 
extent any coordinated effort to develop comprehensive controls was 
quickly torpedoed by the managers of the individual schemes, who 
saw in aggregate reports a diminution in their individual importance 
as controllers of segments of the economy. The maintenance of each 
individual control, they of course felt, was “absolutely essential to 
maximum production.” 

These personal feelings cannot be construed as indictments of the 
patriotism of these men. On the contrary, these administrators were 
extremely patriotic in much of their desire to cling to the control sys¬ 
tems they had established. They knew their materials and they knew 
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the industries through which they moved. They were afraid that per¬ 
sons with less specialized knowledge might jeopardize the national in¬ 
terest not by malice but by ignorance. 

Increasing difficulties in hammering down urgent demands for criti¬ 
cal materials to match supplies led, in September 1941, to the intro¬ 
duction of L orders. L orders were basically designed to restrict produc¬ 
tion of certain classes of products to a stated quantity in a given 
period of time and to limit or prohibit the use of certain specffied mate¬ 
rials or products in the production of other products. They were pre¬ 
eminently a device by which less essential demand for products and 
materials under M-order control might be easily eliminated. L orders 
generally provided for industrial reporting, and as a result were fruit¬ 
ful sources of information about the products and materials to which 
they applied. Issued in limited numbers in the fall of 1941 (there were 
26 issued up to January 1, 1942) they increased rapidly after the end 
of the year and by June 1942 numbered 154. 

Reporting instruments accompanying L orders varied considerably 
since the specific provisions of different orders pertained to specialized 
problems of product and material limitations. Some orders required no 
reporting or applications; others used but one instrument; and some 
used a number of reporting forms. L-117 which limited production and 
distribution of heavy power and steam equipment, used (in August 
1942) ten different statistical and application forms; L-41, pertaining 
to construction, utilized eleven reporting instruments. Through such 
reporting instruments the WPB was able to acquire considerable de¬ 
tailed information about the production, inventories, distribution, and 
use of products and materials covered by L orders. 

Comprehensive Inventory Surveys. The only industry-wide report in¬ 
troduced in 1941 grew out of problems connected with inventory con¬ 
trol. Early in the national defense program, manufacturers were im¬ 
pelled by every sound peacetime operating criterion to accumulate in¬ 
ventories of basic materials. By 1941, a point was reached at which a 
balance had to be struck between increasing defense materials require¬ 
ments on the one hand and efficient utilization of tightening material 
supplies on the other. The imcomfortable spread between material sup¬ 
plies and defense requirements for them, already existing in the case 
of a few basic materials, threatened to expand rapidly to many mate¬ 
rials. Facts about use and consumption of inventories, beyond data ob¬ 
tained through operation of M orders, were essential to permit govern¬ 
ment to flush out idle stocks to help restore balance in the materials 
demand-supply equation. 
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Combating this situation demanded facts—^facts collected on a com¬ 
prehensive, industry-wide basis. But again, half measures were tried 
first. In the begiiming, the 0PM limped along with spot inventory sur- 
vejrs of a few tight individual materials and the broad data available 
in the records of permanent government agencies. None of these sur¬ 
veys, however, provided systematic coverage of inventories from pri¬ 
mary producers through various processing stages to final fabrication. 
But the data at hand, piecemeal and fragmentary though they were, 
showed that inventory accumulation was reaching serious proportions. 
There was little in them, however, to support specific policy decisions 
and administrative action to cope with the situation. 

The first effort to solve the problem, through General Materials 
Order No. 1 issued in May 1941, was completely ineffective. Under the 
provisions of this order, which covered 29 materials, users of the mate¬ 
rials specified filed a monthly statement (PD-19A) that they were not 
increasing inventories, and producers filed statements (PD-19B) that 
they were not shipping to idle inventories. The deficiencies of this order 
quickly became apparent, and other measures had to be taken to make 
operative the ideas embodied in it. 

To establish a factual basis upon which General Metals Order No. 1 
could be made effective, form OPM-69 (inventory and consumption of 
scarce materials) was sent in August 1941 to all manufacturers who 
were holders or users of any of the 70 scarce metals enumerated in the 
form. A similar form, OPM-75, was tailored to fit the needs of all dealers 
in and distributors of scarce metals. In addition, comparable informa¬ 
tion was required on forms OPM-132 and OPM-133 from financial 
houses, public utilities, and other groups who held these metals. OPM- 
69 was mailed to about 70,000 manufacturers; the mailing list for the 
other related forms aggregated approximately 30,000 names. This was 
the first industry-wide reporting effort in the war period. 

Those to whom the forms were sent were asked to report the quan¬ 
tity of each metal which they had used in August 1941 and the amount 
on hand at the end of that month. Definitions of metals for which data 
were requested were broad and extended to the point where the raw 
materials lost th^ identity as raw materials and became fabricated 
products. Fabricated product definitions were also vague and inade¬ 
quate. 

The immediate results of OPM-69 and its related forms were rather 
disappointing, largely because of inexperience in the development of the 
form itself and problems connected with its tabulation. Tabulated sum¬ 
maries of OPM-69, as a result, were delayed imtil December 1941. In 
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the meantime events had moved so rapidly that the data reported on 
OPM-69 and its related forms were considered obsolete by the policy¬ 
makers and administrators for whom they had been collected. Not only 
had difficulties encountered in production moved far beyond the point 
where inventory accumulation problems were of dominant concern, 
but the Divisions of 0PM had collected inventory and other production 
information for many individual products and materials on a current 
basis. Although the OPM-69 data soon lost currency, they contained 
much information which proved to be useful for a long time to come. 

The methods of OPM-69 and its related forms, however, were signif¬ 
icant at this stage of the war effort. Of most importance, the issuance of 
these forms broke the ice, so to speak, in the area of comprehensive 
reports coimected with metal usage and consumption. In addition, 
much was learned about such technical problems as definitions, units 
of measure, instructions, mailing lists, and tabulations as they related 
to industry-wide reporting. Equally as important as the positive meth¬ 
odological lessons were the negative ones which taught what not to do. 

TabulaMng Preference Raiing CertificcAes. Urgent needs for compre¬ 
hensive information about many aspects of industrial production cen¬ 
tered attention upon the aggregation of data contained in individual 
preference-rating certificates and resulted in a tabulating effort more 
ambitious than fruitful. These certificates contained data on materials 
used in production, manufacturer shipments of defense products, pref¬ 
erence ratings, orders, and other sorely needed facts. For a considerable 
period of time the small volume of these certificates, centrally accumu¬ 
lated in the 0PM, was hand-tabulated with some success, but little or 
nothing was done with those received by the Army and the Navy. As 
the fiow of paper mounted, the Bureau of the Census was asked, in 
August 1941, to tabulate data by mechanical pimch card methods. 
Thereafter, the flow of paper reached flood proportions and as a result 
tabulation difficulties grew faster than solutions. Completed sum¬ 
maries were never made and the project was abandoned in January 
1942. 

The Reporting System at the Time of Actual War. The net effect of all 
these developments was to leave the United States upon the eve of its 
active entrance into World War II without the basic statistical-ad¬ 
ministrative tools needed to cope with pressing production and dis¬ 
tribution problems. The core of the most serious problems was demand- 
supply imbalance for metals. Throughout the fall of 1941 it was be¬ 
coming more apparent daily that the supply of a number of basic 
materials, particularly metals, was completely inadequate to satisfy 
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all top priority demands; far too many instances had appeared when 
even the demands of highest-rated military orders could not be filled 
from the limited material supplies made available for these demands. 
The fact w^as assuming crisis proportions that shortages of vital raw 
materials, particularly the basic metals, and inequities in the distribu¬ 
tion of these materials under the operation of individual allocation and 
preference rating systems, were throttling the efl&cient use of the 
American productive machine. Nothing short of carefully controlled 
over-all distribution of scarce materials seemed adequate to secure 
maximum utilization of facilities and full production of essential 
goods. But this necessitated considerable knowledge about both the 
demand for and the supply of scarce materials. Neither the tools for 
acquiring this information, nor those needed to use the information to 
balance the demand-supply equation existed when this country entered 
the war. 

Notwithstanding the efforts of 1941, available facts about neither 
side of the demand-supply equation could provide a satisfactory basis 
for working a balance. Military requirements were unrealistic, inac¬ 
curate, and unbalanced within programs; they were more in the realm 
of "desires” than of “requirements.” The 0PM, and later the WPB, 
had progressed little further in developing comparable essential civilian 
requirements. Civilian programs were disorganized in the aggregate, 
in detail, and as related to military programs. Available estimates of 
both were far too high in totality, subject to rapid and imexpected 
shifts among programs within over-all totals, unadjusted for lead times 
and contingency allowances, and constantly in the process of changing 
specification and design. Uncertainties in the measurement of material 
supplies also created a stumbling block in equating the supply-demand 
balance. In some cases, calculations of both long-run and short-run 
estimates of material supply were so poor that later acquisition of ac¬ 
curate figures showed them to have missed by as much as 50 per cent. 
Overestimates and underestimates were of about equal frequency 
among various individual calculations. Thus, neither demands nor sup¬ 
plies in aggregate terms were accurate enough for intelligent policy 
and administrative action. 

Inaccuracy, bias, and absence of facts as to both the demand and the 
supply sides of the raw material equation met head on in early 1942 
with completely inept procedures for reconciliation of balance, even 
had there been sound requirement and supply figures. The WPB had 
not perfected within its own organization a systematic and effective 
review of the statistical, technical, and conceptual accuracy of even the 
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poor estimates of demand and supply which were available. There was 
no administrative machinery for equating demand and supply aggre¬ 
gates at policy levels. There were only the unsynchronized and crude 
procedures of M orders. Machinery was absent for scheduling raw 
materials and components through the fabricating chain with some 
semblance of coordination. As a result, each procurement agency, al¬ 
most with impunity, heavily mortgaged critical materials, machine 
tools, labor, and other resources needed to protect its over-optimistic 
schedules. In the process no reference was made to competing claims of 
others. The WPB was helpless to reconcile and control these powerful 
influences, which worked towards imbalances throughout the nation's 
industrial productive machinery. 

After more than tAvo years' experience with wartime production con¬ 
trol, facts simply were not available for the most fundamental type of 
policy decision. This situation was epitomized in the character of policy 
action taken by the WPB Requirements Committee which, in early 
1942, knowingly issued its own over-all allocation directives with full 
consciousness that no mechanism existed by which it could determine 
whether its decisions were in full conformance with the over-all in¬ 
dustrial situation; by which it could relate its actions to those taken in 
connection with individual materials or products, either by itself or by 
product administrators; and by which it could be sure that administra¬ 
tion and execution of its actions was adequate for the fulfilment of its 
decisions. So explosive were pressures at the time that policy decisions 
had to be made with or without facts on which to base them. 

These conditions led to the fibrst concrete attempts made during the 
war period to obtain comprehensive information by which intelligent 
over-all metal-allocation policy decisions could be made. Throughout 
1941, and continuing into 1942, substantial volumes of information 
were flowing to administrators of individual products as a result of the 
operation of P, M, and L orders. To coordinate information about these 
individual products and materials and to acquire comprehensive in¬ 
formation by which war industrial activity could be measured and the 
demand-supply equation balanced, two industry-wide questionnaires 
were issued. The first was form WPB-732, which was designed to ob¬ 
tain comprehensive information about operations in the metal-workmg 
trades. The second was PD-276, which was designed to acquire plant- 
by-plant data concerning the use and requirements of critical metals. 
At the same time renewed pressures were exerted to obtain adequate 
requirements data from procurement agencies and to accumulate in¬ 
formation on essential civilian demands within the WPB itself. These 
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measures stimulated a chain of events which eventually resulted in the 
accumulation of sufdcient data by which policy and administrative 
actions were reasonably well supported and coordinated for the first 
time during the war period. 

WPB-7S2, Form WPB-732 traced its ancestry back to the special 
plant reports collected by the Bureau of Labor Statistics for the NDAC. 
Upon the entrance of the United States into the war, it was recognized 
that a central war-control agency could ill afford to be without compre¬ 
hensive knowledge relating to emplo 3 mient, machine and plant utiliza¬ 
tion, and other comparable data concerning all important industrial 
segments. By this time, too, it was clear that the main focus of the 
WPB control had to be centered upon the metal-fabricating trades. 
As a result, WPB-732 was inauguiated in March 1942 to obtain such 
facts from all important manufacturing plants. 

WPB-732 was an operations-type statistical report submitted on a 
comprehensive scale. The original mailing list totaled about 15,000 
plants in the metal-working trades but quickly rose to around 25,000. 
Towards the end of the war the list fell to approximately 9,000 of the 
largest plants, whose operations constituted about 85 per cent of total 
dollar shipments in metal fabricating industries. Information was 
requested concerning the extent to which plants had converted to war 
work, the extent to which plants were operating at capacity, products 
manufactured, unfilled orders, employment, machinery utilization, and 
explanations of failure to keep abreast of war contract delivery 
schedules. 

Submitted on a monthly basis, its primary fimction was to produce a 
broad measure of the trends of war production and to reveal, for 
policy-making purposes, potential areas of production and distribution 
difiBiculty. So useful was the information collected from this form and so 
flexible was its method in adjusting to changing needs of management 
and to varying practices in industry, that it was destined to outlive the 
WPB. Throughout its existence, this form underwent constant revision 
to reflect changes in the preference-rating structure, the introduction 
of the Production Requirements Plan, and the Controlled Materials 
Plan; to include new types of data; and to adjust its informational re¬ 
quests to peculiarities of various industries. Throughout all these 
changes, however, the WPB maintained statistical threads in this 
questionnaire by which a continuous long-thne historical series of com¬ 
parable and related data could be maintained. This was the only 
comprehensive reporting instrument of the WPB to embody this fea¬ 
ture. 
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The data collected were at all times valuable guides to policy on, and 
administration of, the armament production program. As a statistical 
rather than an application-type report, however, the figures were de¬ 
signed for policy rather than administration and did not constitute a 
vehicle for direct implementation of policy. But the figures revealed 
how policy and administrative action were working out in industry; 
served to detect potential problem areas; measured utilization of man¬ 
power and machinery in war work; and provided the basis for hundreds 
of special statistical reports upon which important decisions were 
made. Altogether, the data were used to throw light on a wide variety 
of problems. 

PD-275. The introduction of form PD-275 in February 1942 firmly 
grappled with the materials-control problem facing the WPB. Experi¬ 
ences in late 1941 left no doubt that the prospective drain of an all-out 
war effort on the country’s raw material supplies would become so 
great that only a coordinated materials-control plan would avoid in¬ 
dustrial chaos. It was vital for the WPB to know over-all total require¬ 
ments for basic materials and the supplies available to satisfy them. 
These requirements had to be classified according to major uses and 
time periods, and raw materials had to be scheduled through the in¬ 
dustrial system on the basis of some sound order. A major contribution 
in the acquisition of the facts needed and the methods by which they 
could be used to exercise control Tvas made by form PD-276. 

PD-275 was essentially a modification of PD-25A currently being 
used under the (volxmtary) Production Requirements Plan. Indeed, the 
form was so prepared that information requested on it was the same as 
the data asked for under the voluntary PRP. There were two major 
improvements, however. First, it was sent on a mandatory basis to 
11,000 manufacturing establishments whose operations, it was calcu¬ 
lated, would require approximately 90 per cent of the metals used in 
manufacturing. Second, a list of about 110 metal shapes and forms was 
devised from those under the PRP and preprinted on the form itself. 
The first change assured coverage; the second assured speed in tabula¬ 
tion. For each of the metals, the form requested data on inventories 
and consumption in the fourth quarter of 1941 and anticipated require¬ 
ments in the second quarter of 1942. Manufacturers were asked to sub¬ 
mit a separate form for each class of products manufactured if existing 
inventory and other records were such that separation was possible. 
This technique facilitated the reporting of data about each product. 

The technical results of the survey were heartening. Summary 
tabulations provided the first reasonably accurate over-all measure of 
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metal consumption and requirements by manufacturing areas. The data 
revealed large lumps of critical metals lodging in excessive inventories, 
metal shipments on unrated orders, and metal demands carrying the 
highest ratings in excess of visible supplies. In other respects the sum¬ 
maries threw into sharp relief the fact that, if urgent war and essential 
civilian production was to be met, tighter and more efficient control 
over the flow of metals was necessary. 

The WPB, however, was not quite ready for extended mandatory 
metals control on a comprehensive basis. A second PD-275 was there¬ 
fore issued in April, This form was sent to 25,000 of the largest metal¬ 
using manufacturers and to shipyards, arsenals, railroads, and other 
consumers of metal. In layout, the form was similar to the previous 
issue, with one important modification. The form provided space m 
which manufacturers were required to report metal use and require¬ 
ments for each class of products. Not only was the coverage of this form 
better than that of the first but the data obtained were more accurate 
and more readily tabulated. 

The results of the tabulation of this survey reinforced those of the 
first questionnaire. Both made one fact stand out. The productive 
capacity of the United States was well beyond its ability to supply crit¬ 
ical materials to industry, and, as a result, expansion of the production 
of the most vital military and civilian goods depended upon the im¬ 
mediate introduction of comprehensive coordinated control over the 
flow of basic metals. In addition, both forms, together with techniques 
built up through the operation of the (voluntary) Production Require¬ 
ments Plan, revealed the administrative and statistical methods by 
which this could be accomplished. 

The Production Requirements Plan. The WPB took the obvious step 
in the introduction of the (mandatory) Production Requirements 
Plan on Jime 10, 1942. Under this plan any plant, or division thereof, 
whose past or anticipated quarterly use or requirements of critical 
metals exceeded $5,000 was required to apply for rating assistance. The 
application form was PD-26A, a modification of the original PD-25A 
only in its incorporation of various technical lessons provided by 
PD-275. Essentially these were the preprinted metals stub and the 
direct relationship of metal use and requirements to individual final 
products. Exceptions to filing this application form were granted to 
United States or other government agencies (not including shipyards, 
arsenals, factories, etc., which were subject to the covering Regula¬ 
tion), transportation companies, public utilities, and so on. In practice, 
however, data were gathered by other means for these so-called 
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“exempt groups” so that a complete statement of over-all metal use 
became possible. 

Each plant falling within the above requirements was required to 
submit data, on PD-25A, on the basis of the smallest breakdown of 
operations which existing raw material inventory records permitted. 
The calendar quarter was the time unit and for this time period manu¬ 
facturers reported: (1) the dollar value of past and anticipated ship¬ 
ments for each final product, analyzed by preference ratings; (2) past 
consumption and future requirements for each of about 200 specified 
metal shapes and forms, for all operations of the plant; (3) past and 
anticipated inventory levels for each of the 200 metals; (4) past con¬ 
sumption and future requirements for each of 200 specified metal 
shapes and forms, for each final product; and (5) past consumption and 
future requirements for important components. Other information in¬ 
cluded rated orders, metal requirements for long periods in the future, 
and analysis of ultimate purchasers of products. 

Tho first PD-25A, covering operations in the second quarter and re¬ 
quirements for the fourth quarter of 1942, was sent to approximately 
32,000 plants. Successive quarterly operations extending through the 
second quarter of 1943 permitted a stabilization of the mailing list at 
about 30,000 plants or plant divisions. Of this number approximately 
15,000 applications accounted for practically all metal consumption in 
manufacturing. 

PD-25A fully covered industrial plant operations as they related to 
the consumption and use of metals. With this information, competition 
for basic metals in each plant could be appraised in terms of both mill 
supplies and the relative importance of end products. The entire struc¬ 
ture of input-output relationships in American industry for the first 
time was silhouetted for study, analysis, and application to the prob¬ 
lems of directing tho flow of materials through industry. Further, 
mere addition of figures from each plant revealed reasonably accurate 
actual metal usage and potential requirements which, when related to 
calculated supplies and military and civilian programs, provided the 
means for the balanced allocation of materials in aggregates and in 
details by industries, plants, and products. 

PD-25A approached more closely than any previous effort the sort 
of comprehensive reporting methodology needed in wartime. Tabula¬ 
tion of data on PD-25A put the Requirements Committee of the WPB 
in a position to pare total military and civilian requirements to match 
critical material supplies. When this was done, reduction in authoriza¬ 
tions to buy and use metal was made by WPB administrative divisions 
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among various industry groups on a plant-by-plant basis. These deci¬ 
sions were transmitted to manufacturers by authorizations contained 
on their PD-25A application. PD-25A ,therefore, not only supported 
the highest policy decisions concerning metal allocations but also func¬ 
tioned as an instrument of administrative action. 

Despite the potential usefulness of the PRP as an over-all control 
system, neither the preference-rating system nor individual allocation 
devices used by the WPB vrere eliminated by the adoption of the PRP. 
The effectiveness of the PRP approach to equate demand-supply 
imbalances And to produce order in the flow of materials was seriously 
impaired for these and other reasons noted below. The development of 
this new technique was accompanied by the adaptation and partial 
subordination of old methods rather than their complete abandonment. 
(This tendency toward superimposition of new controls on old ones 
was prevalent in the WPB’s operations and accounts for much of the 
complexity and duplication which existed in the statistical structure.) 
In addition, no great progress was made in improving the techniques 
for calculating basic material supplies, in coming to grips with the 
problem of military and civilian requirements at the top procurement 
levels, or in controlling contract issuance. 

Materials Accounting Reports, The effort to obtain a comprehensive 
system to produce facts to control metal flows brought with it a con¬ 
cept of basic material accountability. It was early recognized that 
accountability for action was an essential part of administrative order: 
it was one problem to obtain information upon which to determine basic 
policy and internal administrative actions to carry out policy, and quite 
another problem to be certain that internal operations and action at the 
production front were in conformance with policy. The latter assurance 
could come only in a system where current records of action were kept. 

Toward the end of 1941 and early in 1942, efforts were made to con¬ 
trol by accounting methods the mortgages placed on scarce metal sup¬ 
plies. They were not too successful. The adoption of the PRP, however, 
required the establishment of internal record-keeping systems by which 
current statements of metal authorizations could be reviewed. In no 
other way would it have been possible to determine whether the WPB 
administrators were authorizing materials within the limitations estab¬ 
lished by the Requirements Committee “pie cuts.” As a result, compre¬ 
hensive materials accounting was undertaken for the fibrst time under 
the PRP. 

The concept of materials accounting control growing out of this ex¬ 
perience under the PRP constituted one of the major innovations in 
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the reporting system. So inherently necessary, sound, and successful 
were the general methods developed that control accounting grew to 
constitute an important part of many future control mechanisms within 
the WPB, in other wartime agencies, and in industry. 

Vertical Versus Horizontal Allocation Reporting, The core of the 
production and distribution problem lay in the tying of materials 
needed by suppliers of parts and subassemblies to end products. In 
the fall of 1942 these problems assumed new importance, as fabricating 
facility shortages increased. Consideration of this problem brought a 
shift in thinking about basic over-all allocation control from compre¬ 
hensive versus individual allocation systems (the PRP had settled 
that conflict) to comprehensive horizontal versus vertical allocation 
techniques. The concept grew more firmly that the horizontal material 
allotment approach of the PRP had to be discontinued in favor of a 
vertical material allotment approach. Thus, rather than allot materials 
directly to manufacturers at each fabrication stage, it was reasoned 
that metal flows could be tied tightly to end products if end-product 
producers allotted metals to their suppliers, the suppliers to their sub¬ 
suppliers, and so on down the contractual chain to basic material 
producers. 

Probably the most important shortcoming of the PRP as an alloca¬ 
tion device (and the one from which most of its other deficiencies 
stemmed) was that it could not force a balancing of military and essen¬ 
tial programs in relationship to limited metal supplies. The WPB 
through PD-25A was in a position to match total metal requirements, 
so far as they were known in industry, against total metal supply. 
Previously the WPB had relinquished its control over the contractual 
rights of procurement agencies and therefore was not in a position: 

(1) to force a balancmg of military end programs within themselves; 

(2) to force a balancing of end programs within total material supplies; 
and (3) to force a tight distribution of metals and components in ac¬ 
cordance with balanced programs. The PRP was forced to carry the 
burden for controlling the flow of metals and components in the ab¬ 
sence of programming and scheduling, a burden it was not designed to 
carry. 

In the absence of balanced military and essential civilian programs, 
the PRP and PD-25A could not effect the desired flow of materials 
through the industrial system nor could they govern the scheduling of 
component flows in the volume and time sequence required. When the 
total supply of tight materials was calculated by the WPB, require¬ 
ments as submitted by industry on PD-26A could be matched against 
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it. This system produced facts on metal requirements of end programs 
for which contracts had been let at each level of manufacturing. These 
requirements could be matched against supply and allocation made 
accordingly. No mechanism existed, however, for forcing procurement 
agencies to pare contracts in conformance with this balancing of de¬ 
mand and supply. As a result, metal allocations made to a com¬ 
ponent producer, for example, often were not in conformance with the 
changing needs of end programs. In the absence of control over end pro¬ 
grams of military agencies, serious difficulties arose with the PRP 
method of horizontal plant allocations whether such allocations were 
intelligently made or not. 

These deficiencies of the PRP could be resolved, it was felt, by metal 
distribution on a vertical basis. With such a system the WPB would in 
effect create a bank balance for quantities of metal for each procure¬ 
ment agency. Those agencies in turn would write “checks” on their 
balances and distribute them to prime consumers. These proceeds 
would constitute a bank balance on the books of prime consumers from 
which they could make authorizations to their suppliers. Suppliers in 
turn would make extensions to sub-suppliers and so on. The net 
result would be, it was argued, that “bank withdrawals” could be kept 
within available balances and the distribution of materials would be 
tightly tied to and coordinated with production schedules. 

After continuous discussion of alternative allocation proposals from 
early 1942 through the fall of that year, this approach to the problem 
of metal allocation was chosen. Among the more dominant schemes 
discussed were: a modified PRP, a “Warrant Plan,” the “General 
Motors Plan,” the “Material Allotment Plan,” and the “Steel Quota 
Plan.” Out of the exploratory discussions pertaining to these proposals 
and the critical materials situation there evolved the Controlled Mate¬ 
rials Plan, the principal procedure of which was a vertical method of 
allocation. This plan was to constitute the only comprehensive metal 
allocation system in operation throughout the remainder of the war. 

The Controlled Materials Plan. The Controlled Materials Plan was 
announced in November 1942, went into effect experimentally in April 
1943 and fully in June 1943. The plan was based upon the theory that 
control over the pi iduction and distribution of a few basic materials 
—steel, copper, and aluminum—^which were known as the “controlled 
materials,” was sufficient to regulate a large part of industrial produc¬ 
tion. (For operational purposes these materials were broken down into 
16 diapes and forms.) Control was exercised on the basis of two prin¬ 
ciples: (1) authorization of only as much production as available sup- 
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plies of basic metals would permit; and (2) establishing checks upon the 
use of these materials to maintain authorizations within limited sup¬ 
plies. To execute this control, four basic steps were followed: (1) deter¬ 
mination of demands for materials; (2) determination of supplies of 
materials; (3) matching demand and supply; and (4) feeding material 
to scheduled production on a vertical-allotment distribution basis. At 
the apex of the pyramid up which demand-supply information flowed, 
and down which allotments were channeled, was the WPB Require¬ 
ments Committee. 

To the Requirements Committee, ultimate purchasers of all manu¬ 
factured products containing controlled materials presented statements 
of their demands for those materials in terms of programs, physical 
metal quantities, and time periods. These purchasers, called Claimant 
Agencies, such as the Army, the Navy, the Maritime Commission, the 
Oflice of Lend-Lease Administration, and the Board of Economic 
Warfare, established so-called end programs designating equipment 
ultimately purchased for their account. Determination of material 
needed to fulfill these programs was facilitated by the acquisition from 
their prime contractors of “prototype” bills of materials (generally sub¬ 
mitted on form CMP-1). Prime contractors in turn requested similar 
information from their suppliers, and so on down the contractual 
chain to the first fabricator of mill shapes. Thus, extending up through 
the contractual chain to the WPB Requirements Committee, detailed 
material requirements flowed. Controlled material supplies in various 
time periods were calculated by the WPB Controlled Materials Divi¬ 
sions. On the basis of this information, the Requirements Committee, 
in consultation with the materials Divisions and the Claimant Agencies 
determined how demand for various agencies and programs could be 
reduced to match supply. When this was accomplished the Require¬ 
ments Committee made lump allotments against the supply to Claim¬ 
ant Agencies and their principal programs. 

Upon receipt of allotments from the WPB, the Claimant Agencies 
distributed quantities among their own detailed programs. Authoriza¬ 
tions for all detailed programs had to remain within total quantities 
received. Each agency, therefore, instead of competing with others 
through the issuance of preference ratings, worked toward a bank 
balance. If it wasted materials in one direction, it had less in another. 
Claimant Agencies then divided their allotments among their prime 
contractors, giving each enough controlled materials to carry on his 
part of the total program. Prime contractors, within the limits of their 
own “bank balances,” passed along the portion of their allotments 
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needed by sub-contractors and sub-subcontractors to produce parts and 
subassemblies for their contracts. Thus, material distribution was 
passed down through the contractual chain. This simple procedure of 
following contractual channels tied metal demand to production 
schedules even though it did not reduce the dollar value of outstanding 
contracts. 

The entire plan did not operate on vertical-allotment distribution. 
Components and shelf-items such as nuts, bolts, ball bearings, and elec¬ 
tric motors were purchased by so many manufacturers for so many end 
uses that producers could not conveniently obtain controlled material 
allotments on the basis of vertical-allotment distribution. For these and 
other manufacturers of products such as pumps, gasoline engines and 
such wide-use components, who would have had trouble in receiving 
allotments by vertical distribution, the PRP method of horizontal allo¬ 
cation was adopted into the CMP. By this means, as under the PRP, 
these producers asked for and received allotments directly from the 
WPB. 

To facilitate operation of the CMP, all finished products of manu¬ 
facturers were classified into two groups. Class A and Class B. Class B 
products originally were intended to include only those subassemblies 
or miscellaneous or specialized items of equipment for which individual 
allotments could not practically be made from prime consumers down 
the contracting chain. This concept was modified, however, and many 
final products ultimately were defined as Class B products. All other 
products were Class A products. 

The distinction between these two classifications was vital in CMP 
operations. All producers of Class A products were included in the 
vertical chain of allotment distribution, submitting their requirements 
(form CMP-4A) to their customers and receiving from their customers, 
in turn, metal allotments. Class B producers, on the other hand, were 
included in the horizontal-allotment distribution system, submitting 
directly to the WPB (form CMP-4B) their requirements for metals 
and, in turn, receiving from the WPB their needed allotments. As dis¬ 
tinguished from the plant approach of the PRP, all CMP application 
and authorization documents were in terms of individual Class A or 
Class B products. 

Insurance that demands were kept within metal supplies, once the 
WPB Requirements Committee had distributed allotments, was ac¬ 
complished by an effective materials accounting system. Establishment 
of adequate accounting controls throughout the allotment system was 
based on the idea, born of experience, that effective control necessitated 
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procedures by which any overdrawing of accounts at any point in the 
distribution process could be prevented or detected. Application of this 
idea resulted in accounts of record showing for each program, for each 
material shape and form, for each quarterly period: authorizations, is¬ 
suances, and balances. Such records were maintained in the WPB 
Industiy Divisions, Claimant Agencies, and every industrial plant re¬ 
ceiving allotments. 

An effective check on the impact of metal authorizations at the basic- 
controlled-material-producer-level was accomplished by a system ot 
periodic reports made by producers to the WPB Controlled Materials 
Divisions. On form CMP-8, producers reported shipments and unfilled 
orders. These reports were tabulated by the WPB and provided a direct 
measure of procurement pursuant to allotment by Claimant Agency 
and Industry Divisions, in terms of end programs for which original 
allotments were made by the WPB Requirements Committee. 

The flow of end-use program information from Claimant Agencies 
through industry to metal mills was made on the basis of the passage 
of allotment program symbols. Originally, identification of allotments 
called for the passage of thirteen digits but the unworkability of the 
transmission of so many numbers resulted in a modification adopted 
before the Plan became operative, which provided only for identifica¬ 
tion of the Claimant Agency and the first digit of its program number. 

Benefiting from the experiences of the PRP, various control S 3 ^tems, 
in effect in the WPB at the time the CMP was introduced were modified 
to conform with the latter. First, and of paramount importance, acqui¬ 
sition of controlled materials was only on the basis of direct allotment; 
the preference-rating system did not apply. Second, substantial modi¬ 
fication or revocation was made among M, E, and L orders to avoid 
any interference or conflict with the operation of the CMP. As a result, 
some questionnaires established to operate individual allocation sys¬ 
tems and other programs were abolished or substantially modified. The 
reporting and application forms of the CMP were given a dominant 
position which never had been extended to the PRP and its instru¬ 
ments. 

At this stage of the war effort, inventories had ceased to be a serious 
problem. Unbalance between the demand for and supply of critical 
materials had flushed out most idle usable inventories. The CMP pro¬ 
vided no automatic cleansing of idle inventories but rather exerted 
control by establishing a 60-day limitation on inventory accumulation. 
To assure compliance a plant report (form CMP-7) was issued covering 
operations in the third quarter of 1943. The form requested information 
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concemiag plant shipments by products and for each CMP metal shape 
and form: actual use, receipts, and inventory. After one quarter of 
operation, the basic features of the form were incorporated into form 
WPB-732. 

The CMP proved to be eminently successful in equating the demand- 
supply formula for controlled materials. Indeed the CMP, perhaps 
more than anything else, was a fruition of the concept that no WPB 
control policy over metals was adequate which did not devolve upon 
procurement agencies the responsibility for balancing their end pro¬ 
grams and establishii^ methods by which these end programs could 
be translated into elemental common denominators of materials and 
processing time schedules. The CMP did force such a balancing and 
capitalized upon it by establishing suitable procedures in the WPB 
for acquiring accurate supply data and for equating demand and 
supply along realistic patterns. The CMP unquestionably firmly set¬ 
tled the problem of equating the demands for and the supplies of 
metals. 

The plan was less successful in scheduling production. In operation, 
the CMP was primarily a materials-control system. It did not provide 
the machinery for balancing component production schedules in con¬ 
formance with end programs. Claimant Agencies had the authority to 
spread out material allotments in conformance with their end-program 
requirements, but the flow of materials was not controlled in terms of 
time and volume through the industrial chain. As a result, component 
production for end programs was often far out of line with real require¬ 
ments. Efforts to solve this problem were embodied in various regula¬ 
tions designed to supplement the operation of the CMP. 

Scheduling Production. There emerged in full force in 1942 a group of 
problems relating to production scheduling. Although their emergence 
was detected early in the year, measures for their solution were not to 
be adopted until 1942 had almost ended. An analysis of four months 
of ordnance production, for example, indicated that this program was 
getting out of balance. The output of end items and the output of the 
complementary items essential to the use of the end items were not 
proceeding at the same rate. The scheduled production of components 
for artillery and aircraft ammunition was out of joint with the require- 
mmits for these components to meet production objectives for complete 
items. Many component schedules ran to two or three times require¬ 
ments, and the schedules for two-thirds of the components called for 
production more than 15 per cent in excess of requirements. The result 
was that the ammunition program was using excessive quantities of 
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scarce materials. This and comparable situations demanded administra¬ 
tive processes by which the flow of critical materials would be controlled 
in accordance with scheduled uses, which ^ould be limited to require¬ 
ments for production objectives. 

As the year 1942 moved on it became increasingly evident that, al- 
thou^ total production was advancing, its rate of progress was level¬ 
ing off. Solution to the problem lay not so much in basic material 
supplies, which had increased considerably during the year and had 
been matched with demand under the PRP, as in a system which de¬ 
fined points where materials were needed to effect a balance among end 
programs and which would direct critical materials to them. 

The military services, which had imdertaken the scheduling of pro¬ 
duction for military items, maintamed little balance in schedules. 
Some schedules were over-optimistic, and others were conservative; 
there was little agreement as to lead factors, conversion periods, and 
provisions for the effect of changes in design and specifications. More¬ 
over—and this was of crucial importance—^there was no central review 
of the schedules for the purpose of reconciling competitive demands for 
common resources. And yet without such a review of schedules to align 
competing demands in relation to the maximum supply of common re¬ 
sources it was impossible to insure the development of realistic sched¬ 
ules. With the development of each new bottleneck in components, the 
record showed more clearly that inept scheduling had to be rectified 
if the nation was to squeeze the maximum production out of its re¬ 
sources. 

In December 1942 a Production Executive Committee was estab¬ 
lished in the WPB with broad and inclusive powers to schedule com¬ 
ponents of all Claimant Agencies; to see that program schedules did not 
conflict, were in balance, and were consistent with the maximum pro¬ 
ductive possibilities of the national economy; and to ensure that sched¬ 
ules wore in accord with the strategic requirements of the Joint Chiefs 
of Staff. A series of preliminary steps were taken to solve scheduling 
and related problems and in February 1943 General Scheduling Order 
M-293 was annoxmced, to become operative on April 1,1943. 

The original M-293 applied to 35 classes of components classified into 
three categories of urgency. The WPB Industry Divisions were given 
fuU authority to control the shipment of these items. Information to 
exercise the necessary control was reported by manufacturers. Form 
PD-900 provided data on production capacity, orders, and order 
schedules for various periods. Forms PD-901 and PD-902 requested 
detailed delivery schedules. After the issuance of M-293, additional 
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WPB Industry Division scheduling orders appeared like dandelions in 
the spring and brought many new questionnaires with them. Since all 
these forms required details concerning scheduling of individual orders 
for individual items and customers, they more frequently than not were 
bulky and exceedingly burdensome to fill. Most of these orders were 
popular in industry, however, because by reason of them a manufac¬ 
turer could freeze shipment schedules and go about his business without 
the need for constant adjustment and readjustment in schedules other¬ 
wise necessitated by the priorities system. 

M-293 probably stimulated the flow of a larger volume of reports 
to the PWB than any other single regulation. After the novelty of com¬ 
ponent scheduling had worn off, most of the scheduling forms received 
by the WPB were filed away without review or any action whatsoever. 

Reporting Trends in Other Areas. The preceding movements, viewed 
from an historical perspective, appear to be those of outstanding im¬ 
portance in the evolution of the reporting structure. Apart from them, 
however, there was constant evolutionary development, of a less spec¬ 
tacular character and with less fundamental impact upon industrial 
control, in hundreds of individual products and materials. In each in¬ 
stance in which the demand-supply equation for a particular product 
was so unbalanced as to impede production, special reporting require¬ 
ments were instituted to provide facts for intelligent administration. 
When the problem ceased to exist, reporting was generally discon¬ 
tinued. The often unpredictable impact of war upon the thousands 
of products needed for its prosecution created a substantial ebb and 
flow of control problems which in turn was reflected in the rise an 
discontinuance of thousands of individual questionnaires. 

The use of control procedures and reporting methods adopted to 
support them often paralleled and leaned heavily upon the experi¬ 
ences described above. Thus, for example, the methods of the Lumber 
Control Plan and the Tire Allotment Plan were essentially those de¬ 
veloped along PRP lines. 

By the middle of 1944 the basic WPB reporting structure was cre¬ 
ated. Thereafter, its movements were essentially caused by modifica¬ 
tions of existing reports rather than by the introduction of revolu¬ 
tionary new methods. With the approach of VE-Day, and later VJ- 
Day, strenuous efforts were made to simplify the reporting structure 
by eliminating from it all questionnaires of little or doubtful use. The 
result was that in the period immediately preceding VJ-Day the WPB 
operated with fewer questionnaires than in any previous period after 
the entrance of the United States into active warfare. 



ON POOLING DATA 

Fbebebice Mostelleb 
Harvard University 

On the basis of samples from normally distributed variates 
X\ and Xi, how should one estimate the mean of Xi? Should 
one pool or not pool the means on the basis of a significance 
test, or should one pool continuously, or not at all? It turns 
out that if the difference between the true means can be 
thought of as normally distributed from sample to sample, 
pooling with unequal weights is preferable. 

INTRODUCTION 

T he purpose op this article is to discuss a very simple problem con¬ 
cerning the pooling of data. If we have a sample of n measurements 
from population A and a sample of n measurements from population 
B, where A and B are similar, how shall we estimate the mean of popu¬ 
lation A? Shall we merely take the mean of the sample from A, or shall 
we attempt to combine the means of the two samples in the hope of 
improving our estimate of A in some average sense? 

A variation of this problem is frequently met in the analysis of vari¬ 
ance where various estimates of error are sometimes pooled after an 
initial significance test has been made. This case has been treated in 
some detail by T. A. Bancroft. The problem of pooling means does not 
seem to have been treated. The principal reason for discussing the pool¬ 
ing of means is that the notions are much more easily understood than 
the corresponding question of pooling sums of squares. Considerable 
insight into the general pooling problem can be gained by an examina¬ 
tion of the more elementary problem. The problem of the pooling of 
means can be discussed in an elementary class in statistics. In addition, 
there are occasions when the question of the implications and the results 
of pooling means arises in practice. 

If we have two (or more) similar measures of an individuaFs perform¬ 
ance, is it preferable to take the later measure as an index of his ability, 
or would it be preferable to average the measures? If a producer sells 
us two very similar products made of the same materials on the same 
machines by the same operators, shall we estimate his performance on 
one of the products by the per cent defective observed on that product, 
or can we get a better picture by combining the results of samplings on 
both products? This question may be of considerable interest if the 
testing happens to be destructive and expensive. These and similar 
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questions are all concerned with the pooling of means. We shall, of 
course, ideahze the problem considerably. 

Let us suppose we have two kinds of “12 inch” rulers, which for con¬ 
venience we designate as “red” and “black.” We assume that both kinds 
of rulers have lengths which are approximately normally and inde¬ 
pendently distributed with equal known variances cr®, but unknown 
means, Ur and Ub, where the subscripts indicate the color of the ruler. 
We take random samples of n red rulers and n black rulers. Let Xr, xj, 
be the sample means, then for k>0 if 



we accept the null hypothesis that Ur—Ubj at significance level 



If the inequality in (1) does not hold we reject the null hypothesis in 
favor of the alternative Ur 9 ^ui,, 

Once the significance test has been made, the following question 
sometimes arises. On the basis of the samples, how should one estimate 
the mean length of the red rulers? 

When the null hypothesis is rejected, many feel that the “best” esti¬ 
mate of Ur is Xr. We know that the variance of Xr is 

( 2 ) 

n 

if we always use just the sample of n red rulers to estimate Ur. But 
when the null hypothesis is accepted some people say: “Now that we 
have accepted the null hypothesis, Ur—Ub, the ‘best’ estimate of Ur is 
the grand average of the lengths of the 2n rulers {xr-{-x^/2J” People 
with a little different philosophy argue: “No matter how many signifi¬ 
cance tests you make you can never prove the null h 3 pothesis. You 
should forget all about the significance test. The ‘best’ estimate of Ur is 
the mean of the n red rulers.” Still others say: “This is a problem of 
estimation, not one of tests of significance. One should merely inquire 
what weights to assign to Xr and No doubt still other people hold 
still different views. 

It may well be that the criterion selected as a significance test and 
the criterion selected for deciding to pool or not to pool are two different 
things. For the purpose at hand we shall work with a model for pooling 
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which uses the same test as that given in equation (1). However one 
may still widi to use a value of k for a significance test which is different 
from that for the pooling criterion. That is, if one were using a .01 level 
for a significance test one might require a .20 level, say, before pooling 
with equal wei^ts. 

It ^ould not be thought that the above discussion implies any am¬ 
biguity about the meaning of a significance test. Bather, the ambiguity, 
if it exists, has to do with the use of the usual concept of test of sig¬ 
nificance as a basis for estimation and action. Of course, the mathemati¬ 
cal principles laid down for the use of a significance test and the actual 
procedure followed often differ. For example, in theory one sets a 
significance level in advance, one makes a test of significance, and one 
accepts or rejects the null h 3 rpothesis at the significance level previ¬ 
ously chosen. 

Actual practice usually differs from the procedure outlined above. 
The person working with the data has no fixed criterion in mind, but 
rather a vague feeling that the .05 level of significance is pretty widely 
used, that the .01 level is somewhat more stringent, and that the .001 
level is something an experimenter’s results do not exceed too often. A 
standard calculation procedure is now performed. Now the experi¬ 
menter tends to report the percentage of times his result would have 
been exceeded had the null h 3 rpothesis been true. The words “hi^y 
significant” are usually tossed about in connection with results beyond 
the .001 level, and special remarks are included about results which 
just do not or barely do exceed the .05 level. 

One might argue correctly that those who practice the procedures 
followed above are not performing significance tests at all, but that 
they are using the standard procedure to help describe the data and 
the results of the analysis. Nevertheless, a very large proportion of 
those who analyze data follow the practice of not setting a significance 
level in advance (or at most a fuzzy one), and at the same time refer 
to their procedure as a significance test. 

The purpose of the present discussion of theory and practice in con¬ 
nection with significance tests is not meant as a criticism of anyone’s 
procedure. It is presented, because it helps indicate how arguments 
can arise about the interpretation of significance tests. 

PBBCaSION OP ESTIMATION 

Krst we need a preliminary agreement upon an objective way of 
measuring which of two estimates is the better. If it is insisted that the 
estimate of Ur must always be imbiased, proponents of pooling the two 
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samples have already lost their case (so poolers should not be among 
those who insist on unbiased estimates). However a more temperate 
view might be that the mean-square deviation D* from the true param¬ 
eter value is a reasonable measure of the goodness of the estimate. 
We diall use this criterion, although others are equally valid. This 
criterion has some computational advantages for the models we will 
construct. 

If we always pool the two samples with equal weights, the mean- 
square deviation of the pooled estimate of itr, £=(ier+^)/2, is 

ff* d* 

(3) + 

2n 4 

where d—tir—Ui, the difference between the means of the red and black 
rulers. 

On the basis of the mean-square deviation criterion, those who rec¬ 
ommend constantly pooling with equal weights have the advantage 
when 

n 

T 

while those who recommend using only the measurements from the n 
red rulers have the advantage when 

n 

— d* > <7*. 

2 

If d* is as great as 2<7*, there is never any advantage in pooling. Further¬ 
more, for any fixed <r and d, d not zero, we can always take samples 
sufficiently lai%e that there will be no advantage in pooling. 

ESTIMATION WITH FIXED DIFFERENCE 

We have not done complete justice to those recommending pooling. 
They point out that they only pool when their criterion suggests that 
they should. Consequently, we must evaluate the expected value of 

[i(*r + «&) — Ur]® 

over the acceptance region, plus the expected value of (Sr—Ur)® over 
the rejection region. When this is done we can divide the mean-square 
deviation by o®/», the variance when one never pools, and thus get a 
disadvantage coefficient. Essentially we are taking the variance of the 
estimate £, as the standard. 
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The mtegrations are easy to perform, and the disadvantage coefiS- 
cient turns out to be 


1 / fk-X 

C = —< 1 + iM N{y)dy + (k - x)Nik - x) 

2 I •/-i-i 

+ (A: + x)N(k + x) + C N{y)dy + f N{y)dy\ . 

J J _* ; 


In equation (4), x—dl<r\/nf2, the distance between the true means 
measured in terms of the standard deviation of the difference of the 
sample means; k is the critical ratio indicated in equation (1); and 
NiU) is the normal distribution 


c 


3 


FIGURE 1 


JL_ 

iMseo 





o 

THE DISADVANTAGE COEFFICIENT C AS A FUNCTION OP x^d^n/ir^M FOR 
**0, 1. 2. 3, ««>. WHEN d IS FIXED IF \t\ <i, 2»(*r+x6)/2 IS USED TO ESTIMATE ti,, 
WHILE IF I <1 THE ESTIMATE £r IS USED 
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The bdiavior of C is more easily understood if one examines the 
graph shovm in Figure 1. We only show positive differences, because 
the figure is symmetrical about x=0. Now providing the difference is 
very small we can get advantages from pooling up to 10 per cent when 
*=1, up to about 35 per cent when &=2, up to 47 per cent when k=3, 
and up to 50 per cent when fc = «. When k—Owe never pool—^that is, 
we never expect to find «r=3fe, exactly. When «, we always pool, 
that is we set our criterion so large that we invariably pool. 

The disadvantage of pooling occurs when x gets large. If we always 
pool, the mean-square deviation can be so large that the disadvantage 
coefficient tends to infinity. If we have any criterion which stops us 
from pooling, it will happen that when the difference between means 
gets too large we will not pool. Consequently for any finite value of k, 
as X increases, the disadvantage coefficient will rise to a maximum, 
and then return asymptotically to C=1. It will be noted that when 
A;=3 it can happen that the mean-square deviation is about three times 
the variance we get when we never pool. Similarly for k=2, C can grow 
as lai^e as 1.75, while for 1;=1 the maximum for C is about 1.12. 

Diagrams such as Figure 1 suggest that if one has any additional 
information about d, then something might be gained by pooling or 
not depending on the indication of the figure. For example, if one were 
confident that [d] ^1, o-=30, n=50, then x^.l67. In this case one 
should always pool. 

There are examples where the distribution of the true mean can 
reasonably be thought to be approximately normally distributed. If we 
examine the day to day performance of an individual on certain psycho¬ 
physical tests after the initin.! training period we find that the distribu¬ 
tion of his test-to-test means can be approximated by a normal distri¬ 
bution even though his test-to-test variation cannot be explained by 
random sampling errors from a single population. Also, if we examine 
the distribution of mean ediuscUional age of 6th grade classes in a certain 
area, we will find these means to be approximately normally distributed 
but with variance too large to be explained by random sampling from 
the same population. 

It is not the intention of the author to ai^e that there are large 
numbers of examplea of normally distributed means, but rather that 
there are many examples of cases where the parameter can be thou^t 
of as having a distribution of some sort. In sudi cases there is a possi¬ 
bility of improving an estimate of if it is known that two samples 
(or more) are drawn one from a population with mean Ur, another from 
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a population with mean Uh, where both Ur and uj, are samples from a 
parent population of true means. 

ESTIMATION WHEN DIFFERENCE IS DISTRIBUTED 

We have seen how the disadvantage coefl&cient C behaves when the 
difference d between the means is fixed. In a series of experiments, or a 
lifetime of estimation, we might prefer to think of d as varying. If we 
knew something about the frequency with which d takes on different 
values we might be able to take this into account. Of course this is the 
fundamental Bayes Theorem difliculty. However if we are wiUing to 
consider the problem from the point of view of a long series of similar 
experiments, we can construct some instructive models. 

For example, we might be willing to assume that d is normally dis¬ 
tributed with zero mean and variance for a series of experiments 
of a certain type for which we would take samples of the same size n. 
It is not unreasonable to assume that d is symmetrically distributed 
about zero, because the choice of the order Ur—uj, or Ub—Ur is arbitrary. 
The assumption of normality is a more difficult case to argue. It will 
not be attempted. Instead it is suggested that it would be useful to 
have a number of additional models of various kinds worked out for 
problems of this nature. 

STANDARD ERROR OF THE ESTIMATE 

It needs to be emphasized that in some cases one is anxious to have 
an accurate estimate and is willmg to sacrifice knowledge of its ac¬ 
curacy, while in other cases it is very important to have a "reasonably” 
unbiased estimate of the standard error of the estimate. If one never 
pools, one can use o^/n as the standard error as usual. In cases where 
we always pool the standard error of the estimate will be o^/2w, even 
though the estimate may be biased considerably. When we use a "some¬ 
times pool” procedure the situation is much more complicated. It 
would, of course, be inaccurate to assign the standard error the value 
a^/rij when we do not pool, and <r®/2n when we do. Instead we must 
think of the distribution of the estimate of the mean and get its stand¬ 
ard error for repeated sampling. The standard error will depend on the 
significance level and on the true difference between the means. We may 
underestimate the standard error if we use either of the above formulas. 

If it is assumed that d is normally distributed, we may expect the 
poolers to continue to pool when | <k, and not to pool when 1 1\ S*, 

making their estimates in the same way as described above. 
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Again we will compute the disadvantage coefficient C. This time 
the integtations are more tedious, but we come out with 


C = 1 


J 0 


( 5 ) 


2 


+ 2 


N(v)dv 


)■ 


where y=naK It will be remembered that n is the sample size, oV is 
the variance of d, and h is the criterion for pooling or not pooling. As 
before NiU) is the normal distribution with zero mean and unit vari¬ 
ance. 

An examination of equation (5) shows that when k—0, 0=1, which 
is what we expect if we never pool. Further, when &= <» we always 
pool, and the limit C=(2+Jia*)/4, which can easily be checked to be 
the expected value of 2>®(*)/o*(*r) when d has the distribution we have 


It can be shown that when y is large 0 tends to unity asymptotically 
from above like l+k*\/2/3-s/2iry(H-5/y). This means once again that 
if a pooler almost always gets large discrepancies in the t-test, he will 
only pool occasionally, and the effect will be essentially as if he did not 
pool at all. 

Figure 2 shows more clearly how 0 behaves as y/y=aVn goes from 
0 to 6. We see that all the curves for 0<i:< « are contained between 
the curve C=(2+y)/4 for « and the strai^t line <7=1 for fc=0. 
It can be verified from equation (5) that when Vjr =V^> 0=1. 

With the present model we note that for small values of no ®=y there 
are advantages up to 50 per cent to be gained by pooling. For na^=y>2 
some loss in precision wiU occur. However, with the present model the 
losses do not go as high as when we dealt with a fixed value of d (Fig. 1), 
providing fc < oo. 

For convenience, we have graphed 0 against Vy> for 
For k=Z a very flat maximum occurs at about Vy=4.6, and <7= 1.7; 
for &=2 the maximum occurs at about •\/y=4 and <7=1.2; while for 
fc=l, the maximum occurs at about •>/y=Z, and C=1.04. 

We compare the maximum C's for the two models discussed in 
Table I. 

Of course, even if one accepts the present model, one still has to 
make a decisicm between pooling and not pooling. We do not usually 
enter into an important experiment with no ideas at aU about the order 
of magnitude of the quantities we wUl^find. 
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TABLE I 


MAXIMTTM DI 3ADTA NTAQB COEFFICIENT C IN POOLING FOB TWO CASES: 
1) FIXED DIFFEBENCB BETWEEN MEANS d; 2) d IS NORMALLY DIS¬ 
TRIBUTED. THE CBITERION FOB POOLING OB NOT POOLING IS I 


k 

d fixed 

d normal 

0 

1.00 

1.00 

1 

1.12 

1.04 

2 

1.75 

1 2 

3 

3.0 

1 7 


For example, suppose we believe that the standard deviation of the 
difference between the mean lei^hs of successive pairs of shipments of 
red and black rulers is about .001 inch, that the standard deviation 
of red and black rulers is about .010 inch. If we take a sample of 50 


Z 

c 

Ks3 


FIGURE 2 


L 


/a:= 





THE DISADVANTAGE COEFFICIENT C AS A FUNCTION OP V5*aVn FOR h^O, 
1. 2. 3. WHEN d IS NORMAMiY DISTRIBUTED WITH ZERO MEAN AND UNIT 
VARIANCE. IF 11| <A. i-(.Sr+ii,y/S IS USED TO ESTIMATE Vn WHILB IF [j| i*. THE ESTI¬ 
MATE t, IS UBBD. THE LOWEST CURVE, Hr, IS BASED ON THE MAXIMUM LDSEUHOOD 
ESTIMATB dr->(:^4«)-(^)/C2+»}. 
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rulers of each color, then o=.001/.010=0.1, and o-s/n=-707. So no 
matter whether we are usii^ ft = 1, 2, or 3 as our criterion, Figure 2 
indicates that the advantage is “probably” with the poolers. The word 
“probably” is used because the quantity .001 inch given above is 
really a guess on the part of the experimenter. There is nothing repre¬ 
hensible about intelligent guessing. The most successful experimenters 
are those who combine a good knowledge of their subject matter and of 
experimental design with ability to make judicious guesses. The elS.- 
ciency of experiments can almost always be improved by good guessing. 

On the basis of this model, therefore, if there is some possibility of 
Tinfl,THng intelligent guesses about a and <r, the author would decide be¬ 
tween pooling and not pooling on the basis of C and the sample. In 
the absence of such guesses the author feels that between those who 
never pool, and those who require a two-sigma agreement (| <| <2) or 
better before pooling, there is little to choose. It does not seem likely 
that the latter can be injured badly on the average, and they may gain 
considerably. 

On the baris of the present mvestigation the author’s principal reason 
for disliking the use of a ft as liberal as 3 comes from the disadvantage 
factors when d is regarded as fixed. It is always possible for an experi¬ 
menter to encoxmter a series of experiments where d does not behave 
as it is assumed to do in the variable model, and in that case one may 
very well be multiplying the no-pooling precision by a factor of 3. 

Of course, if the observations are cheap and easy to get, the problem 
scarcely arises; one merely takes a larger sample of red rulers. 

UAX3UUM LIKBLIHOOD TOOLINQ 

We have gone to a good deal of trouble to inquire about what hap¬ 
pens when the decision to pool or not to pool is either made a priori, or 
made on the basis of a criterion which is equivalent to a significance 
test. We have kept to this line because we were interested in this ques¬ 
tion, and because this is a method often used in pooling. 

An important model which might be mentioned however is the maxi¬ 
mum likelihood method. If we consider the case of d fixed, the maxi¬ 
mum likelihood estimate is ir, and (7=1, as we have noted. But if we 
assume as in the previous modd that d is nonnidly distributed, the 
maximum likelihood estimate suggests that we alwajrs pool, but assign 
varying weights to Sr and St. The TwaTrimiiTn likelihood estimate of Ur 
is just 

( 6 ) 


, ^(1 + 4 - Sh 

^ -- 

2 -h no* 
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If na®=0, that is d=ii,—Ui=0, the pooled estimate (*,+*»)/2=* is 
the maximum likelihood estimate. If no? is extremely large compared 
to itr, £b, and 2, then the maximum likelihood estimate reduces to £r> 
The mean-square deviation of from tt,, using the model with d 
normally distributed gives 


(7) 


DKilr) = 


<r®(l -1- no®) 
n(2 -f- no®) 


Here we see that as no® goes from zero to infinity the pooled estimate 
has mean-square deviation running from <r®/2n to ff®/n, that is from the 
variance for 2n observations to the variance for n observations. The 
lowest curve (labelled <ir) of Figure 2 is a graph of DH-Cir)- 

Equation (7) is also the asymptotic variance of ^ given by maTrinruim 
likelihood methods when the true value of a® is unknown, but to be 
estimated from St and Sb. It is not clear however exactly what estimate 
of a® should be used for small samples. However, the estimate is ap¬ 
proximately (jSr—jei)*/<T®. If this value is used for o* in equation (6) it is 
clear that when the difference between the sample means is large Xr gets 
most of the weight, while if it is small the weights are more equally 
balanced between the sample means. 

The curve for Hr in Figure 2 shows that the disadvantage factor for 
the maximum likelihood method is always less than imity. This implies 
that if one has reason to believe that d is normally distributed the 
mft TriTniiTn likelihood solution is by far the most profitable among the 
ones discussed. The implication is that at least for this situation pooling 
with unequal weights is usually the best policy. The all-oivnone cri¬ 
terion method cannot compare with the estimate of equation (6). 

This result is rather important because it backs up our intuitive feel¬ 
ing that we should use aU the information at hand in makii^ our esti¬ 
mates. 


CONCLUSION 

We have presented and discussed several ways of pooling data from 
two samples to estimate the mean of the population of one of them. 
Perhaps the most important point of the discussion is that for many 
kinds of estimation, not only of means, we need more models to study. 
These models help us understand what is going on when we pool data 
from several samples to make a single estimate. Furthermore they may 
assist us in deciding which of several posable methods of combining 
data may be preferable. 
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The author is indebted to J. Wolfowitz for calling his attention to an 
error in an early draft of this paper. 
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USE OF VARIANCE COMPONENTS IN THE ANALYSES 
OF HOG PRICES IN TWO MARKETS 

R. L. Anoerson 

Institute of Statistics, North Carolina State College 

Corrections to article published in Volume 4^, No, 240 
December 1947 

Page 618, Table 4: The variance components for the days mean square 
should include 24(4f. 

Page 624, line 2 should read: = Vdfny+Vmyc—Vdmyo 
Page 624, line 4 above the table should read: 

Ml V^ “1“ day Vdmyc' 

Page 625 in the Table for F', last line, should read; 3.34*. 

Page 360, line 6 from the bottom should read: ay= ± 1. 



ON ESTIMATING PRECISION OF MEASURING 
INSTRUMENTS AND PRODUCT VARIABILITY 

Fbank E. Gbubbs 

Ballistic Research LahoratorieSf Aberdeen Proving Ground, Md. 

A measurement or observed value is discussed as the sum 
of two components—one the absolute value of the character¬ 
istic measured and the other an error of measurement. The 
variation in absolute values of the characteristic or items meas¬ 
ured is termed product variability, whereas the variation in 
errors of measurement of an instrument is called the pre¬ 
cision or reproducibility of measurement. Techniques are 
given for separating and estimating product variability and 
precision of measurement. Comparisons of the various tech¬ 
niques are also discussed for cases involving two or more 
instruments. 

1. INTRODUCTION 

A MEASUREMENT consists of the sum of two components, one com¬ 
ponent being the true or absolute value of the item or charac¬ 
teristic measured and the other an error of measurement. In making 
measurements, the aim is to obtain as nearly as practicable the true 
or absolute value of every characteristic measured. This is accom¬ 
plished either by attempting substantially to eliminate errors of 
measurement or by using an instrument for which the errors of meas¬ 
urement are known to a sufficient or estimable degree. In particular, if 
the errors of measurement are for all practical purposes constant then 
rather accurate knowledge of the true values of the items measured 
can be ascertained when the size of the constant error can be cali¬ 
brated by reference to a standard. On the other hand, if the errors of 
measurement vary appreciably from one observation to another in a 
series of observations, then the accuracy and usefulness of the re¬ 
ported results will depend largely on available knowledge concerning 
the relation between the errors of measurement and the true values of 
the characteristics or items measured. Thus, as the general demand for 
closer tolerances increases it becomes important to investigate methods 
for studying the relation between the two components of measurement 
in order to insure that the instrumentation involved meets its purpose 
adequately. 

The purpose of the present investigation, therefore, is to present a 
statistical study of methods for estimating and comparing product 
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variations and errors of measurement. The methods discussed are 
quite general and should apply to most iostrumentation set-ups which 
can be employed as outlined below. The problem itself arose at Aber¬ 
deen Proving Ground in connection with investigations of the ade¬ 
quacy of electric clocks in measuring burning times of powder train 
fuzes, in measurements of velocity with two or more chronographs, and 
in measures of rotating band clearance, to mention some.^ 

2. MBASUBEMENTS WITH A SINGLE INSTBUHEET 

Suppose a sin^e instrument is used to measure some characteristic 
of n different items. The results of measurement may be reprinted 
symbolically as follows: 

*1 + Cl 
Xa -(- cj 


*»• + Cl 


*«+ «n 

where xi, xa, • ■ • Xi, • • • x„ are the true values of the characteristics 
or the items measured and the quantities, ci, cj, ■ ■ * c<, * ■ ■ c» are 
errors of measurement. Generally speaking, Xi, xa, ’ ‘ • Xi, • ‘ • x» aie 
all different since there is variability in the items or product measured. 
Also, the quantities ci, ea, etc. will usually vary because a different error 
in measurement may be made on each item. It is to be noted here that 
in the rin^e measurement or observed value, the error of 

measurement, e{, is inseparable from the true value of the character¬ 
istic, Xi. That is, for example, a ^^e micrometer measurement may 
be, say, .5635 inch and it is not known precisely what part of .5635 
inch is the error of measurement, which in this case arises from the 
combination consisting of the operator and the micrometer. If all the 
errors of measurement ei, ea, etc. were zero it would be said that the 
instrument is accurate or perfectly calibrated for each measurement 
(or that the measurements are absolute) and also that the utmost in 
predsion or reproducibility has been attained. Precision here is de¬ 
fined as a measure of the variation in the errors of measurement. 


1 Somewhat aimilar pzoblemSt namely those of estimating components of yariance, are treated in 
Snedecor's book StatisUeal MdhoiSa Iowa State College Press, under the index heading *Vatiance, Com¬ 
ponents of* and in a paper by H. E. Danids *The Estiznadon of Components of Yaiiance,* Journal of 
ike Royid fSCotfsticaZ Soeieij/a Supp., Vol. Tl, Kb. 2,1939, pp. 18ft-197. 
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eiy e? 2 , etc. If the quantities Ci, 62 , etc. vary considerably it may be said 
that the measuring instrument lacks precision. On the other hand, if 
aU of the errors of measurement are equal, i.e. if ei=C 2 = • • * 

= • • * 6 », our instrument possesses the utmost in precision because the 
same constant error in measurement is made each time. However, such 
an instrument does not give absolute measurements unless corrections 
are made for the constant error of measurement. It is seen, therefore, 
that accurate knowledge of the true values or characteristics, X\y X 2 , Xz, 
etc. cannot be obtained unless somet hing is known about the errors of 
measurement, ci, ez, etc. 

If the measured or observed values Xi+ei, xz+ez, etc. are averaged, 
one obtains 

-= £ + c 

n 

where x is the average value of the characteristic measured and e is the 
average error of measurement. (As in the case of Xi+Ci above, the 
average observed value, x+e, consists of inseparable components also.) 
If actually, c=0, then it may be said that the errors of measurement 
average out to zero and that the average observed value would be 
“accurate”; but on the other hand, the measurements may not be 
precise since the individual errors of measurement may vary consider¬ 
ably from one observation to another. 

In order to determine the precision or reproducibility of measure¬ 
ment, the amount of variation in the quantities Ci, 62 , etc. should be 
examined, especially as compared to the variation in a;i, • • • x^. 
Now, the variance of the observed measurements is given by 

Ti {(*.• + e<) -(« + «)}* 

» - 1 <-1 

= —^ S - ») + (e« - e) }* 

= —53 (®i “ *)* "1-7 S “ 2) 

H — 1 t—1 U — 1 t»i 

+ i) (ei - 

W "" 1 *-=I 
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The quantity 


s** =« —^ 2 (x{ - i)*, 
n — 1 .-1 

which is based on a sample of n measurements, gives an unbiased 
estimate («—1 degrees of freedom used) of the true variation or vari¬ 
ance in the characteristic a: of a larger category from which the random 
sample of n observations is taken. In a like manner, 

»«- --7 (c,- — g)® 

» — 1 t-i 

is a measure of the variation in the errors of measurement and gives an 
unbiased estimate of the variance in errors of the measuring instru¬ 
ment. The quantity. 


1 ” 

8a« = ^ ^ C®i “ ®)(®» ” 2)> 

n - 1^1 

known as a co-variance, gives a measure of the relation between the 
values of the characteristics measured and the errors of measurement. 
In this discussion, it will be assumed that over a somewhat limited 
range in values of the characteristic, x, no correlation exists between 
the values of the characteristic measured and the errors of measure¬ 
ment, and consequently that the mean or expected value of the co- 
variance, s„, win be zero (see below). It may be true that errors of 
measurement in some cases increase with increasing magnitude of the 
characteristic measured; however, in many problems the x and e are 
suffidently independent to insure that limited variations in x are not 
reflected in the errors of measurement. 

If s,* is small (on the average), relatively speaking, then it may be 
said that the measuring instrument is precise; on the other hand, the 
larger s«®, the more impredse are the measurements. If is small, 
there is little variation in the characteristic measured: otherwise the 
variability in the charact^istic measured or product is not small. The 
efficiency with which the absolute values Xi, Xi, • • • ,x» are measured 
depends on (1) the relative size of s,® and s*®, the two components it is 
desired to estimate and (2) proper calibration of the measuring in¬ 
strument. An important point of interest here is that in uting a single 
measuring instrument on a group of items, the variance of the observa¬ 
tions eonelsts of the sum of inseparable components—one the variance 
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in the items measured and the other the variance in. the errors of meas¬ 
urement. 

With a single measuring instrument, therefore, it apparently is not 
possible (1) to separate an individual error of measurement, e„ from 
the value of the characteristic, a;,, (2) to separate the caierage error of 
measurement, S, from S, the average value of the characteristic meas¬ 
ured and (3) to separate the variance in errors of measurement from 
the variance in the characteristics measured in order that the relative 
magnitude of these components could be studied. It is true that by 
measuring the same characteristic over and over again vithout bias or 
if there is no variation in the product (i.e. if a:i=X 2 = • • • =*»), then 

could be obtained directlj'. Also, e could be determined if the value 
of * were known or if the instrument itself were calibrated accurately 
against a standard. 

The usefulness of a measuring instrument depends therefore on 
knowledge concerning the errors of measurement and a factor of pri¬ 
mary importance is the relative order of magnitude of and s**. Thus, 
methods are sought for separating the components, and The 
actual separation of s,® and s/ requires the use of at least two in¬ 
struments as pointed out below. First, however, certain definitions and 
assumptions will be ^ven. 


3. DEFINITIONS 

Let a:,=the absolute measurement or true value of the charac¬ 
teristic considered, where i represents the «th item in the 
sample 

e,/=the error in measurement of the fth item by the jth in¬ 
strument, I, 

The obsen>'ed values are then x, and the mean measure¬ 
ment, 

1 " 

X + e, = — + ®«j) 

n 


Taking 


s„v or ««., --- 22 (“» ~ «)(«'* ■" P)* 

n — 1 


22 («. - «)*• 


n — 1 


1-1 


it follows that 
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Define also the following expected values: 

E(xt) = n, E{eij) = «y 

E(Xi — m )* = E(eti — «,•)* = «■«/. 

Hence, 

4. ASSUMPTIONS 

We will assmne that re,- and ey,- are both normally distributed (al¬ 
though this is not necessary in so far as estimating variance only is 
concerned) and that the absolute values of the characteristic measured 
and the errors of measurement are statistically independent, i.e. 

E{x^-eif) = E(x,^)-E{eiit). 

In addition, it will be assumed that the errors of measurement of 
the differ^t instruments are also statistically independent of each 
other, i.e. 

= S(e<yP)-S(ett«) j 9 ^ k. 

5. THE SEPARATION OP S,® AND 8,® WITH TWO INSTRUMENTS 

If two instruments h and Is are used to make measurements on the 
same n items, the observed values can be represented symbolically as 
follows: 


Measurements 

Measurements 

by h 

hy Is 

Xl + Bii 

Xl + Ci* 

+ ca 

* 2+822 

Xi -f- en 

Xi + 8,1 

Xn + 

®n+ Sn, 


For the case of two instruments, there are two methods or computa¬ 
tional procedures for estimating predition of measurement. 
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As an estimate of the variance in errors of measurement of in¬ 
strument Ii, we take^ 

(1) est, ““ 

where 

1 “ 

s»+«i.»+e2 —-7 ^ {(aJt + c»i) — (5 + O} {(a* + e»2) — {x + h)} 

n — 1 ,«a 

1 ( ” 

= - -771^ (^1 + eti)(xti + 6 * 2 ) 

n{n - 1) I 

- [ 2 («* + «.l) j 2 («. + 

and as an estimate of the variance in errors of measurement of in¬ 
strument I 2 , we compute 

(2) est. (<r«2^) = S^a^4-«2 

The estimates ( 1 ) and ( 2 ) are unbiased, since, for example, 
est. (O = 5^2 ^ symbolically 

and 

E(se,^) = 

whereas 

= E(sx^) == E(seie^ = 0 Under the assumptions. 

Method B 2 : 

It is remarked in passing that a techrdque of working with differ¬ 
ences in corresponding measurements of the two instruments is of some 
interest. That is, if the measurements of 1% are substracted from cor¬ 
responding measurements of h, the result gives 

«11 — «12 
C 21 — «22 

6*1 “ 6*2 

6ni ““ 6n2, 


s EBtunates (1) and (2) are maximnm likelihood esiunates. 
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thus eliminating the true value of Xi, the characteristic measured. 
Using the above differences, an estimate of <re^ is given by 

(3) est. s^®+e 2 "b e 2 }* 

It is easy to see that estimate (3) is symbolically 


"b 


and hence precisely that given by (1). 

In order to estimate the amount of variability in the product being 
measured, we may first add corresponding measurements of the two 
instruments, obtaining symbolically 

Xi + eii +2:1+612 


Xx + ea + a:* + 6*2 


Xn + €nl + Xn+ €„,%• 


The variance of the above sums of readings is sjunbolically 

45*^ + + iSxei + 4Sjce2 + 

SO that an estimate of the variability in the product, is given by 

(4) est. ^“ei-eg}. 

Actually, however, the estimate of o-,® is given simply by the covariance 
of the readings of h and I 2 : 

(4a) est. (o-*®) = (4 and 4a are identical). 

Estimate (4) or (4a) is obviously unbiased and can be shown to be the 
maximum likelihood estimate. 

In order to determine the reliability of our estimates of precision in 
measurement, we examine the variances of the estimates. Now the 
population variance of (1) is given by 

^{est. (o-ei®) — CTei®}® = — (Tej*. 
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It is well-known* that we may ^vrite 

1 


= 


n(n — 1) 
2 


{{n — l)ju4:«i + (3 — 


» — 1 


fr ^ 


for a normal parent where /X 4 .ei is the fourth central moment of e,i- 
Also 

E{Sxe^) = 


n — I 


EiHeie?) = -- ffeiV,/ 

n — 1 

so that 

^{est. (<r*i*) — 

(5) 21, 

=-7 <rn* H- - 

» — 1 71—1 

likewise, the population variance of the estimate of o’*/ is given by 
Sfest. (v,j®) — 


( 6 ) 


-r H-r 

n — 1 7» — 1 


and the population variance of the estimate of product variability is 
given by 

Efest. (v.*) - <r,*}* 


(7) 


-- ff,* H-- 

7» — 1 71 — 1 


It is to be noted that the variance of the estimate, est. or est. 
(v«,®) depends on (a) v*®, the variance in the characteristic measured. 


* See, for escample, R. A. Fisher, "Moments sjid Product Moments of Sampling Distributions.*' 
Proeeedif^fa of the London MathemaUecd Society^ VoL 30,1020, pp 215-218. 
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(6) (Tq®, the variance of the errors of measurement of instrument h, 

(c) (Tq®, the variance of the errors of measurement of instrument h and 

(d) n, the number of observations or the sample size. It is seen, th^e- 
fore, that in order to obtain a predse estimate of <rq® when using only- 
two instruments, the variation in the characteristic measured, i.e. 

should be held to a reasonable Tninimum or the sample size, n, 
should be sufficiently large. 

If the variation in the characteristic measured is zero (or if we meas¬ 
ure the same item over and over again), i.e. if «r»*=0, then one could 
compute 


n — 1 _i 

directly, and this would give an estimate of o-q® with variance equal to 

2 


Apparently, in employing two instruments, there are only two com¬ 
putational procedures of interest for separating the variability in the 
product from the variance in the errors of measurement and both 
methods give the same estimate. In using either method, however, it is 
possible to estimate Vq®, <r«,® and o-*® and thus determine from the 
relative order of magnitude of these quantities whether the instruments 
are sufficiently precise to use in taking the required measurements. 

It is pointed out that the average observed value of h is and 
the av^age reading of /s is £+et, so that upon subtracting, Si—% is 
obtained, the average difference in the errors of measurement of 7i and 
Is. Of course, Si and Sa are inseparable here also and estimates of either 
depend upon calibration of h, h or both against a standard. 

6. THE SBPASATION OF S*® AND S,^® WITH THEBE INSTSUMENTS* 

If three instruments It, Is and la are used to make measurements on 
the same n iten^, the observed values can be represented as: 


* The qaestions of optunam estimates and efficiency of methods suggested below for three or more 
mstraments are not answered in this paper. It is believed, however, that the suggested technigpes show 
promise of providing good, workable estimates. It is hoped that a more complete investigation of this 
problem will be undertaken in the near futore mid that others* will interest themselves in this rather 
important study. 
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Measurements 
by Ji 

Xi + fill 
Xi + 521 


Measurements 
by J 2 

Xi + 612 
X 2 + 622 


Measurements 
by Is 

xi + ei3 
X 2 + 623 


Xn + 6nl Xn + e»2 Xn + Cna 

For the case of three instruments two techniques will be considered 
for estimating o-«/, and the variances in the errors of measure¬ 
ment of instruments 7i, I 2 , and Ja. 

Method Azi 

Two computational procedures giving rise to the same estimate will 
be given here. 

First, an estimate of is given by 

Also, in some particular applications it may be enlightening to 
eliminate the value of the characteristic measured by working with 
differences. That is, for each item measured, form the three columns of 
differences in observed values of the instruments taken two at a time. 
Thus, 


Differences in 

Differences in 

Differences in 

Readings, Ii — J 2 

Readings, Ii — Js 

Readings, I* — Is 

€11 + 012 

eu — ^13 

612 “ Cl 3 

621 — ^22 

621 ®23 

622 — €23 

®nl ““ ^n 2 

^nl ^n 3 

^ni ■“ ®n 3 


The variances of these three columns will be symbolically 

«* ” 2Se2fij 

Hence, as an estimate of take 
(9) est, ((Tei^) = 

Estimates (8) and (9) are identical, both being equal to 

symbolically. 
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Estimates (8) and (9) are imbiased and also free of the characteristic 
measured. 

As an estimate of form 

(10) est. “• Sx+ci,T^ei ~ S»+«s,»f«j 


or carry out a computation similar to (9). 

As an estimate of form 

(11) est. (ff,,*) = H” ®s5+<2.»+e8 


or carry out a computation similar to (9). 

The variance of estimate (8) or (9) is ^ven by 

J?{est. (<r^*) — = E{est. (<r^®)}® — 

=* .BI Sej® ®ei«j "1“ ®«2e8 } * 


In an analogous manner to that for determining the variance of an 
estimate in the case of t-vro instruments, it is easily seen that for three 
instruments 


^{est. ((Tq®) — o-*®}® 

( 12 ) 2 


n — 1 


+ 


re — 1 


(<»-«iV«j® + o-q®(r,,® + <r«,® 0 -,,®). 


likewise, the variance of the estimate (10) is 

(13) -- (ft* -- (<r,i®(r,.® + + v.2*o-.s*) 

re — 1 re — 1 


and the variance of (11) is given bj- 
, . 2 1 

(14) -- <r,,* -- (ff^®o-,,® + o-n*v«,® + <r,j*<r,,®). 

re — 1 re — 1 

Thus, the variances of the estimates of variation in errors of measure¬ 
ment obtained above for three inslanunents are entirely free of <r«*, the 
variability in the characteristic measured. Apparently, at least three 
instruments are needed to eliminate the characteristic x. 

Estimate (8) uting three instruments wiU be more precise than esti¬ 
mate (1) employing two instruments provided 

-f- <r,i®(ir,,* -b <r,5®<r«,® < er,®<rei® -f -f o-ei®<r*s* 

i.e. if ^ Vx®. 
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Since in general the variance in errors of measurement should be 
smaller than the variance in absolute values of the characteristic 
measured, it would appear that there is some advantage in the use of 
the additional measuring instrument, 7a for the purpose of estimating 

Method Bz:^ 

This technique involves the variance of the readings of h and the 
average of the three co-variances which can be formed from the three 
sets of observed values for instruments h, 1 2 , and la. 

Now the co-variance of the 1% and I 2 readings is 

= Sx^ + Sxei + + s^iet Symbolically. 

The co-variance of the h and Iz readings is symbolically 

+ Sxea + 

and that of the Ii and Iz readings is 

“h "f* "1“ 

The average of these three co-variances is symbolically 

“I" “h '3^»«2 ^ f *4“ ^Se2ez 

As previously stated, the variance of the observations with in¬ 
strument 7i is symbolically 

The latter variance minus the average co-variance gives as an 
estimate of <rei^ 

est. i “I” ” 1 ” 

(15) *= Sei^ “b §5*6, §^£«3 ‘ISciea 

“ -Iseaea Symbolically 

and the variance of this estimate is, in a maimer analogous to that 
previously indicated, given by 

-7 H- ^ 

(16) TO — 1 TO — 1 


* Mr. H A. Noble of the Ballistic Research Laboratory, using correlation coefficients, proposed a 
technique for estimating precisely equivalent to Method Bt and also su^^ested the problem of 
comparing the variance of Method Bs with that of Method At when three instruments are employed. 
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Recalling that Method Az for the case of three instruments gave an 
estimate whose variance is given by 

2 1 

-7 H-+ (TejVe,*), 

71—1 71—1 

it may be seen that Method Bz will not give an estimate of as pre¬ 
cise as that of Method Az unless 

2(cr^2a-es® + 

(Ta* < - • 

+ (Te,* 

Should it be assumed that then replacing and Ce^ 

by cTej® in the numerator and replacing ae^ by in the denomi¬ 

nator above, one arrives at the liberal inequality for o-*®: 



But even relaxing the inequality thus and irnder the assumption, 
then the estimate of Method Bz therefore will still not be as 
precise as that of Method Az unless 

< (Tea®. 

Generally speaking, however, it should be desirable for the precision 
of measurement or variance in errors of measurement (such as 
or to be considerably smaller than the variance in the charac¬ 
teristic measured, < 7 *®. The appropriate relation between v** and 
depends upon the purpose for which the measurements are taken and 
the cost of insuring that o-e/ is sufficiently small relative to or*®. 

To obtain an estimate of product variability when three instruments 
are used to measure the characteristic x, the average covariance of the 
readings of h, h and I 3 is computed. That is 

(17) ^ ,x+«2 H” ®»+ei.»+e* "b ®»+«s,ap+ea} 

= 5*® + f ®*ei + |®*ea + §5*68 + 

+ isetez symbolically 
with variance therefore equal to 

( 18 ) ;r^ '■*+;r^ 
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In case it may be desirable to list the average observed values of the 
characteristic measured, as in acceptance tests of products (see example 
at end of paper), then may be estimated by the following method 
which involves variances of differences in readings of the three in¬ 
struments. The mean reading of the three instruments on the ith item 
is symbolically 

f 6»2 + iCtS 

and the variance of such means is symbolically 

s*® + + l(s*ei + S*e2 + Sxeg) + -KSnea + + ®e2Cs)- 

Subtracting l/18th the sum of variances of differences in readings of 
the instruments taken two at a time, one obtains the unbiased estimate 

(19) est- (cr^B®) = “1” H" s*« 2 —ea}* 

If it were desirable to work with sums of the readings of Ii, h and I 9 
of each item, (19) may, of course, be rewritten as 

(20) est. (c**®) = -}-{ «2 “i" ea "t” ®*e 2 —cal } • 

It is remarked that (17), (19) and (20) are identical estimates of prod¬ 
uct variability. 

It may be noted that the average measurements of the individual 
instruments /i, h and Iz are 

5 + ^1, X + X + ez 

respectively and the differences in average errors of measurements of 
the instruments in pairs are 

61 — §2, 61 — 63, ?2 — 

which, of course, can be computed from the data ^ving rise to three 
equations and three unknowns 

= Cl 

61 — 63 = C2 

62 — 63 = Cs. 

However, the point of interest here is that although Ci, Cz and €3 are 
known from the data, the above simultaneous equations cannot be 
solved for the average errors of measurement Ci, h and h since the 
equations are obviously linearly dependent. Of course, the size of Si, 
€2 and ez may be determined relative to a standard. 
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7. THE SEPARATION OF AND 5®/ WITH POUR INSTRUMENTS 

K four instruments h, J 2,13 and J 4 are used to make measurements on 
the same n items, the observed values or readings of the instruments 
can be represented as follows: 

Measurements Measurements Measurements Measurements 
byZi by 72 by Is by 74 

Xi + 611 Xi + 612 Xi + ei3 Xi + ei4 


X 2 + 621 


X 2 “f” ^22 


X 2 + ^23 


X 2 + ^24 


Xn + Cnl Xn + Bn2 


Xn "I" ^nS Xn ^ni 


From the discussion so far, it is apparent that and tr*® may be 
estimated by considering the readings of h with any one of the in¬ 
struments 72, Iz and li —similarly for estimating and <re/. Also, 

the variances in errors of measurement and variance in the charac¬ 
teristic measured may be estimated by considering the readings of any 
three of the four instruments at a time. These two problems have al¬ 
ready been treated above. It remains, therefore, to consider simultane¬ 
ously the results of measurement of all four instruments and to com¬ 
pare the precision of estimates thus obtained with that for the case in¬ 
volving three instruments. 

Method A 4 : 

Using the variance of the readings of h and the covariances of the 
readings of the four instruments taken two at a time, an estimate of 
(Tei is given by 

* 4 " "b ^«+«s,s+e 4 "b 5*4.83, 

On the other hand, by forming the six columns of differences in read¬ 
ings of the four instruments taken two at a time, an estimate of or^i* 
is given by 

( 22 ) est. { 5‘'ei_82 "4"es *4“ 5 * 81—84 ^ [5*ej—ej "t" 5 * 82—64 “4" 5 * 83—84 ] } . 

Estimates (21) and (22) are equivalent and symbolically equal to 

~ I(5ei«2 + 56182 + 58184) + i(5826j + 58264 4" 58 , 64 ). 

Consequently, the variance of estimate (21) or (22) is 



ESTIMA-TING PHECISIOIJ- AND VABIABIUTT 


259 


(23) » - 1 ■'« + 

+ + <r.,v,«*)}. 

Estimates of <r,,® and by the above procedures and the vari¬ 
ances of such estimates can be formed simply by rotating the sub¬ 
scripts «], 62,6s and 64 in (21) or (22) and (23) appropriately. 

The estimate (21) or (22) of and its variance (23) are entirely 
free of x, the characteristic measured, as is the estimate (8) or (9) em- 
plo3dng three instruments. Estimate (21) or (22) using four instruments 
will be more precise than estimate (8) or (9) using three instruments 
provided 


ie provided 


5<r„*<r«,® -1- 5<r<,®(r.,* + 8(ra,®<r,,® 

’■«4 < 

4vai® ■+• (Taj® -|- 0-,,® 

This condition appears easy to meet, since, for example, if 
o-a,®=<r,®=va,*=ff*, then one has only to be sure that 

Vaa* < 3tr®. 

Moreover, if it is assumed that vai®^va,®^v^,® then estimate (21) 
will surely be more precise than estimate (8) or (9) so long as 

j 5(ra.*vn® + 8<r„®v..* + 5v..»Va,* ^ 3V 
^ 4ff,,* + <r,,* + <r,,® ” ffa,® 

or if <r,a® < 3<rei®. 

It is believed that this condition may be met easily in practice. Con¬ 
sequently, the use of an additional measuring instrument wiU appar¬ 
ently decrease the variances of the estimates of predaon of measure¬ 
ment. 


8. THE SEPABATION OF 8,® AND 8,® WITH N INBTETJMBNTS 

Defining 6,, as before to be the error in measurement of the (th item 
(i=i, ;?,•••,«) by the yth instrument, it is easy to extend the above 
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results to any number, N, of instruments which are employed to meas¬ 
ure the items, Xi. An estimate of the variance in errors of measurement 
of instrument h is for N^3 given by 


(24) 


eSt. (cTei®) — ^oH-ei.aH-eA 

AT — 1 I r-2 ) 

+ W-»W-2){ 


which is symbolically equal to 


2 _ 


9 N 


Jfe-JV 


«l«r 


+ 


(N -DiN- 2) 




SO that the vaiiance of (24) is ^ven by 

2 1 ( 4 w 

—7 -7 ITw—7« ^ "■•iV* 

— 1 n — 1 — 1)* 


(25) 


+ 


4 


(iv - i)*(iv - 2)* 2i;kk 


2^j<fc J 


Formulae for estimates of <re^f etc. and variances of such estimates 
may be foimd by obvious rotations of the subscripts. 

The estimate of variability in the product is taken as the average of 
the covariances, i.e. 


est. (ffaf-) = 


(26) 


NiN - 1) 




2 ^ 2 
xr ^ TiTf-KT " 

N r=l N(N - 1) l^r<. 


(N 2) 


53 symbolically 


so that the variance of estimate (26) is given by 


w — 1 




4- 




n - 1 {N^ 




H 0-.r® 


NHN - 1)* 




(27) 
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Should it be desirable to work with differences in errors of measure¬ 
ments of the instruments taken two at a time, an estimate of is 
given by 

(28) est. M S 

N - 1 N - 2 2S3<k J 

Estimates (24) and (28) are identical and hence have the same variance. 

Using the average measurement of the N instruments on x,* and differ¬ 
ences in errors of measurement, an estimate of product variability is 

1 ^ 

(29) est. (cTx^) = s^»+(i/i\r)(ci+«2+* • •+eu) 771777 77 S sV-e«* 

N^iN - 1) lgr<. 

On the other hand, using the sum of the measurements of the N 
instruments on Xf and columns of differences in errors of measurements, 

(29) may be rewritten as 

If 

(30) est. {(Tx^) = “S J\raH-ei+<2+ • • •+«« 77 7 £ f • 

I N - 1 lSr<» ) 

It is easy to show that (29), (30) and (26) are identical estimates. 

9. EXAMPLE ON ESTIMATING PRECISION OP MEASUREMENT 
AND PRODUCT VARIABILITY® 

In Table I there are listed the individual burning tunes of powder 
train fuzes as measured by each of three observers on 30 rounds of 
ammunition which were fired from a gun. The fuzes were all set for a 
burning time of ten seconds. 

The burning time of a fuze is defined as the interval of time which 
elapses from the instant the projectile leaves the gun muzzle until the 
fuze functions the projectile. The times given in Table I are measured 
by means of electric clocks. A switch on the gun muzzle starts three 
different electric clocks as the gun is fiired and each observer stops his 
clock the instant he sees the flash or burst of an individual round. Each 
timer, of course, stops his clock independently of the other two timers. 
In Table I is given also the mean time of the three observers on in¬ 
dividual roimds. The average time-to-burst of the 30 fuzes is taken as 
the average of the 30 mean times with the effect of reaction time (which 
is known accurately) subtracted therefrom. The problem here is to 


* The data for this example are given also in Colonel L. E. Simon’s paper, *On the Rdation of 
Instrumentation to Qnality Control,” iMtrumenU, voL 19, November, 1946. 
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TABLE I 



Fuze Bumins Times 


Mean Times 


(seconds) 


(seconds) 

Observer 

Observer 

Observer 


A 

B 

C 



10.10 

10.07 

10.07 

10.080 

9.98 

9.90 

9.90 

9.927 

9.89 

9.85 

9.86 

9.867 

9.79 

9.71 

9.70 

9.733 

9.67 

9.65 

9.65 

9.657 

9.89 

9.83 

9.83 

9.850 

9.82 

9.75 

9.79 

9.787 

9.59 

9.56 

9.59 

9.580 

9.76 

9.68 

9.72 

9.720 

9.93 

9.89 

9.92 

9.913 

9.62 

9.61 

9.64 

9.623 

10.24 

10.23 

10.24 

10.237 

9.84 

9.83 

9.86 

9.843 

9.62 

9.58 

9.63 

9.610 

9.60 

9.60 

9.65 

9.617 

9.74 

9.73 

9.74 

9.737 

10.32 

10.32 

10.34 

10.327 

9.86 

9.86 

9.86 

9.860 

10.01 

lost 

10.03 

10.020 

9.65 

9.64 

9.65 

9.647 

9.50 

9.49 

9.50 

9.497 

9.56 

9.56 

9.55 

9.557 

9.54 

9.53 

9.54 

9.537 

9.89 

9.89 

9.88 

9.887 

9.53 

9.52 

9.51 

9.520 

9.52 

9.52 

9.53 

9.523 

9.44 

9.43 

9.45 

9.440 

9.67 

9.67 

9.67 

9.670 

9.77 

9.76 

9.78 

9.770 

9.86 

9.84 

9.86 

9.853 


determine whether the electric clock is a satisfactory instrument for 
measuring burning times, provided the electric clocks are properly 
calibrated with regard to av^age time so that the systematic error of 
measurement may be reconciled. This question is answered by com¬ 
paring the variance in errms of measurement with the variance in 
burning times of the fuzes. 

In Table II there are given the algebraic differences between times 
of the three observers A, B, and C on each roimd. 

It is to be noted that any one of the differences listed is not influenced 
by the level of burning tame of an individual fuze and represents the 
actual difference in errors of measurement of the two observers in¬ 
volved. Thus, the variance of each of the columns headed A-B, B-C, 
and A-G, ^ves the variability of the difference in errors of measure¬ 
ment of the two observers compared. Therefore, taking the variances 
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TABLE n 

DIEFEBENCE BETWEEN OBSEaVEBS’ TIMES 
(SECONDS) 



B-C 

A-C 

+ .03 

0 

+.03 

+ .08 

0 

+.08 

+ .04 

-.01 

+.03 

+ .08 

+.01 

+ .09 

+ .02 

0 

+ .02 

+ .06 

0 

+.06 

+.07 

-.04 

+.03 

+ .03 

-.03 

0 

+ .08 

-.04 

+.04 

+.04 

-.03 

+ .01 

+.01 

-.08 

-.02 

+.01 

-.01 

0 

+.01 

-.03 

-.02 

+.04 

-.05 

-.01 

0 

-.05 

-.05 

+.01 

-.01 

0 

0 

-.02 

-.02 

0 

0 

0 

— 

— 

-.02 

+.01 

-.01 

0 

+.01 

-.01 

0 

0 

+.01 

+ .01 

+.01 

-.01 

0 

0 

+.01 

+.01 

+.01 

+.01 

+ .02 

0 

-.01 

-.01 

+.01 

-.02 

-.01 

0 

0 

0 

+ .01 

-.02 

-.01 

+ .02 

-.02 

0 


of the three columns in Table II, the following three equations are 
arrived at; 

s*«-«s = .0007030 sec.® 

(a) = .0008878 sec.® 

= .0003108 sec.® 

where Si, sj and es represent the errors of measurement of the observers 
A, B, and C respectively. 

Using formulae similar to (9) of the paper for estimates of <r^®, o-,,®, 
and <r«,® it is found that 

v«i = .0253 sec. 

(b) <r»t =* .0079 sec. 

(r«, = .0157 sec. 
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The above figures give a direct comparison of the ability or precision 
of each of the observers. It is seen that observer B is perhaps the best 
of the three since his precision of measurement is given by a standard 
deviation of only .0079 second, whereas observer A is the poorest 
timer of the three. As a matter of fact, one can look at the columns in 
Table II headed A-B and A-C and note that for the first 10 roimds or 
so observer A had a definite lag in stopping his clock as compared to 
the other two observers. 

An estimate of d is obtained by using formula (19), i.e. by subtract¬ 
ing l/18th of the sum of the variances of differences (a) from the 
variance of mean times in column foiu* of Table I. Thus, 

est. (ff.*) = .046098 - *{.0007030 + .0008878 + .0003108} 

= .04599 
or 0“* .2145. 

Thus, it is seen that the variation in burning times, o-*, is about 8.5 
times the variance in errors, (r«i, for the poorest operator and about 12 
times the average variability in errors of measurement of the three 
operators combined. Consequently, it is found that the combination of 
electric clocks and operators provides an adequate measuring instru¬ 
ment. 

10. SIGNIFICANCE TESTS* 

The problems of significance tests for comparing a-*®, etc. 

and judging whether ci—€2, ei—€3, • • • , ft—€3, ■ ■ • , differ 

significantly from zero need to be investigated, although certain rough 
or approximate tests may be evident to the reader. In this connection, 
it is remarked, for example, that for the case of three instruments if 
and ffe,- are of the same order of magnitude, then the approxi¬ 
mate number of degrees of freedom for the est. est. or est. 

is 1(72 — 1). However, the reader is warned that questions of de¬ 
pendence among the estimates need to be investigated since they may 
have an important bearing on significance tests. It is hoped that an 
investigation of these problems can be undertaken in the near future.** 


* While this paper was processed for publication, the author developed sequential probability 

ratio tests for the use involving two instruments. 

** The author is indebted to Professor W. G. Cochran and referees of this paper for valuable sug* 
gestions in preparation of the manuscript. 




LIFE COMPANY INVESTMENTS AND THE CAPITAL 

MARKETS* 


Febgus J. McDiabmib 
The Lincoln National Life Insurance Company 

1. Savings made through life insurance are at this time the 
most common by far of all types of individual savings. Life 
insurance accumulations at the present time account for some¬ 
thing around one-fourth of the total savings of individuals. 

2. Savings made through life insurance are a very stable 
form of savings, varying less with business conditions than 
other forms of monetary savings. 

3. At the present time, life insurance savings form by far 
the most important source of capital which seeks mainly a se¬ 
cure rate of return and safety of principal, rather than tax ex¬ 
emption or chance for capital appreciation. Traditionally they 
have been invested very largely in evidences of long term debt. 

4. In the decade prior to the war, life company funds seemed 
to be outgrowing traditional investment outlets. 

5. However, at the present time this trend seems to be in 
reverse, with investment outlets, even in traditional form, be¬ 
coming rather plentiful. With declining savings, higher prices, 
and a huge real capital demand both here and abroad, we seem 
to have entered a period of real capital scarcity. 

6. The main new outlet developed in recent years for life 
insurance funds has been direct investment in real estate, both 
housing and commerciaL 

7. The tendency for life insurance companies to invest 
largely in debt media may create a problem from the point of 
view of some industries which should be financed largely 
through equities. 

GROWTH OF LIFE INSURANCE ASSETS 

T he assets of life insurance companies domiciled in the United 
States passed the $50 billion mark late in 1947, and they are in¬ 
creasing at a net rate in excess of $3 billion a year. This represents a 
doubling of the prewar rate of asset increase, a rate which still lags 
somewhat behind the increase in the national income over that period. 
Not only does this building up of life company assets create an invest¬ 
ment problem of great magnitude, but it has an important influence on 
the capital markets in determining the types of securities through which 
capital can most readily be raised. Also the acceptability of an industry 

* A paper presented at the 107th Annual Meeting of the American Statistical Assoriation, New 
York City, December 30,1947. 
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as a medium for the investment of life insurance funds may determine 
the relative ease with which it can raise capital in any fonn. 


LIFE INSUEANCB FUNDS IN EBLATION TO TOTAL SAVINGS 

Just how importmit is the flow of monetary saving throng the life 
insurance companies, in relation to total savii^ by our people? The sum 
total of saving by individuals has been estimated by the Department of 
Commerce and the Securities and Exchange Commission. Elach year’s 
savii^ represents the sum of a number of items, the main ones being 
the increase in holdings of currency and bank deporits, the increase in 
insurance and pension reserves, the increase in security holdings, in¬ 
cluding XT. S. Savings Bonds, the increase in the net equity in dwellings, 
and the decrease in consumer debt. 

In the boom year 1929, the increase in life insurance reserves ac¬ 
counted for over 40 per cent of the total estimated net savings of in¬ 
dividuals. During the decade of the thirties, the life insurance reserve 
increase slowed down some; but it constituted a rising proportion of 
total savings of individuals, averaging about three-fourths of the total 
estimate of these during that period. During the war years, the total 
of individual saving was greatly swollen; and life insurance accumula¬ 
tions, althou^ rising, did decline as a proportion of total savings to 
less than 10 per cent of the total estimate in 1944. Since they reached 
their wartime peak in that year, net savings of individuals have de¬ 
clined very sharply, while life insurance assets have continued to grow. 
As a result, life insurance accounted for 22 per cent of the total saving 
flow in 1946. Because of the continued decline in the total of new saving 
by individuals, probably because of high living costs and the greater 
availability of durable consumer goods, life insurance funds probably 
will increase as a proportion of the total in 1947 and may very well 
account for at least one-fomrth of total individual saving. 

Certain features of the American economy tend to increase the rela¬ 
tive importance of savings made throng life insurance. It has become 
an increatingly urbanized economy and one of continuously greater 
specialization. To an increaring extent, the tools of production are 
financed and owned on a collective basis rather than throu^ direct 
ownership by individuals. An expanding proportion of saving^ takes 
place throng monetary channels rather than through the direct 
ownership of property. 

Also, because of ascendii^ income tax schedules on high incomes and 
the enlaigement of the middle class group from below by hi^er real 
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wage levels, a large and increasing proportion of total savings is now 
derived from people in the middle income brackets. This is the group 
which probably makes the largest proportional part of its savings 
through life insurance rather than through the direct ownership of 
securities and property, including equities in businesses. 

There is still further evidence of the importance of life insurance as 
a means of saving. According to a study of consumer savings in 1946, 
published in the Federal Reserve BiMetin for August, 1947, the payment 
of life insurance premiums constituted by far the most common form 
of individual savings. Out of every 100 spending units in the country 
that year, 76 saved through the payment of life insurance premiums, 
18 saved through pa 3 nnents on mortgages, and 13 by payments into 
retirement funds. These figures are not mutually exclusive. The point 
I wish to make is that, as a means of popular saving, life insurance led 
the parade by a very wide margin, the number of spending units saving 
through life insurance being not greatly exceeded by the number of 
such units saving through all other media combined. 


NATURE OF UFE INSURANCE SAVINGS 

Savings made through life insurance have certain noteworthy char¬ 
acteristics. First of all, they are relatively stable—^much more so than 
the sum total of individual savings combined. The reasons for this can 
be readily surmised. As lon^ as a life insurance policy is kept in force, 
the accumulation of a reserve becomes an automatic aSair; and, in 
general, people tend to keep their life insurance in force even if they 
have to forego temporarily other types of savings. In brief, the institu¬ 
tion of life insurance has tended to make the accumulation of monetary 
savings more of a continuous and automatic process than it would be 
otherwise. 

Another characteristic of life insurance savings is the relatively ciiv 
cumscribed scope for their investment. By law and tradition, the 
investment of these has been limited largely to senior securities, which 
means very largely evidences of debt. In other words, their investment 
has been restricted veiy largely to bonds and real estate mortgages. 

There are a number of reasons for this state of affairs. In the first 
place, life insurance assets have been traditionally regarded as trust 
funds and their investment in the most senior and secure inv^tment 
media has seemed proper. 

Second, the mathematical calculation of life insurance reserves as¬ 
sumes a fixed but certain rate of return, and it has seemed proper that 
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these reserves should be backed by assets which appear best able to 
assure this type of return. 

Third, the above two considerations have been reinforced by law. 
The laws of most important states specify that life insurance funds, 
aside from certain limited proportions, should be invested in evidences 
of debt. 

Fourth, regulations governing the valuation of life insurance assets 
tend to favor investment in bonds and mortgages. While well secured 
bonds and mortgages may be carried at cost or at amortized values, 
stocks, both preferred and common, must be carried at current market 
values. This tends to limit equity investments to a low percentage of 
assets because of the possible effect on the company's surplus of a seri¬ 
ous decline in stock market prices. As a practical matter, therefore, life 
company investment in stocks rarely approaches the legally permitted 
limit, and seldom exceeds about 5 per cent of the assets of an individual 
company. 

Because of the quite rigid manner in which the liabilities of a life 
insurance company are calculated, security of principal and reliability 
of rate of return are more important considerations than the chance for 
capital appreciation. Also, because of the manner in which life com¬ 
panies are taxed, tax exemption of income from securities is of very 
little value to them. In these respects, their position differs radically 
from that of wealthy individuals in high income tax brackets, to whom 
the chance to substitute a capital gains tax of 25 per cent for the 
regular income tax schedule is a prime consideration. The position of 
life companies with respect to tax exemption of securities is consider¬ 
ably different from that of the commercial banks, whose earnings are 
taxed on the same basis as those of corporations in general. 

Because of these factors, life companies are mainly buyers of types 
of securities which have largely lost their appeal to individuals, namely, 
relatively high grade corporate bonds and mortgages. While there is 
some overlapping of interest between the life insurance companies and 
commercial banks, in general their requirements are different. Banks, 
to provide backing for their demand liabilities, invest largely in short 
term securities; while life companies, because of the steady growth in 
their assets and large and comparatively reliable cash inflow, do not in 
general feel a pressing need to sacrifice rate of return for liquidity. Also, 
tax exemption in a security has real value to a commercial bank, while 
in recent years the life companies have largely disposed of their tax 
exempt municipal bonds. However, because of sharply higher yields 
on municipal bonds, the life companies are again beginning to buy 
them on a small scale. 
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As a result of these factors and also because of the very large volume 
of funds at their disposal, the life companies have become the most 
important source of long term capital in this country of the type 
which is interested primarily in security of principal and sureness of 
rate of return. They have replaced the individual investor in this role 
since the last great period of capital expansion in this country—^that 
of the twenties. 

The relative importance of life insurance funds in the economy and 
the restriction, legal and otherwise, placed upon their investment, have 
given rise to a good deal of discussion and to some action, as we shall 
see later. 


SUPPLY AND DEMAND FACTOES 

For a long period in the history of this country, up to about a dozen 
years ago, the fact that life companies were largely restricted to in¬ 
vestment in debt securities did not seem to create any serious problem. 
Up to the year 1930, at least, the growth in life company assets was 
approximately matched by that in long term private debt, consisting 
of corporate bonds and real estate mortgages. Up to that time, such 
private indebtedness had been the traditional channel of life insurance 
company investment, and in 1930 accoimted for 84 per cent of the total 
invested assets of the companies. 

But whereas in the decade of the twenties a $12 billion increase in 
life company assets was matched by a $30 billion increase in long term 
private debt, during the fifteen years beginning with 1931, life com¬ 
pany assets increased $26 billion while private debt declined by $22 
biUion. Or, putting it a little differently, in 1930 life company assets 
equalled about one-fifth of the total long term private debt outstand¬ 
ing, while at the end of 1945 they were equal to nearly two-thirds of 
the total of such debt. 

The point is that during this latter period, the traditional forms of 
investment media did not prove adequate to absorb the increase in life 
company funds. During this period, the life companies took over an 
increasingly large proportion of the total outstanding private debt. 
At the present time, for example, they own over one-half of the total 
bonds of the electric, telephone, and gas divisions of the public utility 
industry. 

In the years just prior to the war, the life companies purchased a 
substantial volume of U. S- Government bonds, largely for want of 
other investment outlets; and during the war, their holdings of these 
bonds mounted to reach a total of nearly one-half of the total assets 
of the companies. During the war period, the 3 deld on long term U. S. 
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Cxovemment bonds of 2§ per cent compared favorably 'vrith a yield of 
less than J per cent more on hi^ grade long term corporate bonds. 
This spread in yields has since about doubled, the yield basis on which 
long term high grade corporate bonds can be sold in quantity being at 
least § per cent higher than that on longest U. S. Government bond 
issues. 

INVESTMENT IN BEAL ESTATE 

Probably the most noteworthy attempt to expand the investment 
field for life company funds was that into the direct ownership of real 
estate. Such real estate investments take two main forms—Chousing 
development and the ownership of business properties. The latter 
are usually leased to corporations of satisfactory credit standing on a 
basis to give the life company a moderate return and amortization of 
the investment over an extended period of years. 

In recognition of the need for additional investment outlets, as well 
as the social desirability of permitting life company funds to flow 
directly into medium-cost housing (there is no low-cost housing any 
more), tiie laws of a considerable number of states have been amended 
to permit such investment. At the present time, the law of New York 
State, where ^fe companies having about 60 per cent of the total assets 
of the industry are domiciled, permits investment of up to 10 per cent 
in housing projects built by the companies, and an additional 3 per 
cent in business or residential real estate. New Jersey, also an impor¬ 
tant state life insurance-wise, permits an investment of 5 per cent of 
assets in real estate held for business or residential purposes. Massa¬ 
chusetts currency permits 10 per cent of assets in rental housing and 
10 per cent in other real estate. Coimecticut and Wisconsin have catch¬ 
all provisions in their laws which permit investment in real estate or 
other investment media not otherwise authorized. Indiana has a law 
which permits investment in commercial real estate, subject to its 
lease to a corporation of defined credit standing, and in residential real 
estate. 

THE PRESENT LABOE CAPITAL DEMAND 

In the quite recent past, many people were fearful as to how far 
the lack of balance between supply and demand would go in depressing 
yields available for fixed interest securities of the type in which life 
companies had traditionally been invested. Today, for the time being 
at least, such fears are somewhat abated. A good many of us failed to 
foresee the huge postwar demand for capital and the dSect of inflation 
in stimulating that demand. The rise in prices has had the effect of re- 
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quiring from $1.50 to $2.00, or even more, to the work formerly done 
by $1.00. 

Take the case of the utilities. From reading prospectuses, tihe im¬ 
pression is gained that many electric companies will make additions to 
their plants in the five years beginning with 1947 which will cost in 
dollars around 50 per cent of the cost of their existing plants. It costs 
about two-thirds more now than before the war to install a kilowatt 
of steam generating capacity. 

American Telephone & Telegraph is currently engaged in a construc¬ 
tion and financing program which will add about 40 per cent to its 
recent capitalization. 

Many manufacturing and mercantile concerns have suddenly awak¬ 
ened to the fact that they need a lot of new money and need it rather 
badly—so badly that they are not in too good a position to aigue about 
terms. It costs a lot more to carry inventory at present prices, and some 
inventories are imbalanced and a bit slow movii^. 

The recent demand for mortgage money, infiated by high real estate 
prices, has been particularly great; and some mortgages which were 
eagerly sought for a year ago now go abegging. Some life insurance 
companies have found themselves overly committed in their mortgage 
acquisitions and have practically withdrawn from that field. 

Apparently, what we are witnessing is a sharp rise in tiie demand for 
investment capital, coupled with a continued decline in savings. High 
prices have probably reduced the ability of people to save. In any case, 
the interest on h%h grade long term bonds has increased by over § per 
cent in the last If years, and the trend is upward. It currently appears 
that there will be a very large supply of such bonds sold for new mon^ 
purposes over the next few years. Also, a yield of 3 per cent, which 
looked pretty bad to the institutional investor on the downside, now 
looks pretty good going up. With greener pastures available in their 
traditional investment fiel^, there is a greatly lessened incentive for 
life companies to seek new investment channels at this tune. 

BBOIC THE CAPITAL BAISBE’s POINT OP VIEW 

So far we have looked at this matter largely from the point of view 
of the life insurance investor. We will now attempt to look at it briefly 
from the point of view of the party requiring capital. Take the public 
utility industry as an example. It is perhaps the prime example of an 
industry requiring a large amount of funds of the type supplied by the 
life companies, that require a reasonably steady rate of return with 
capital appreciation as a quite secondary consideration. 
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It has come to be regarded as proper by the regulatory authorities 
that a utility should raise not much over half of its required capital in 
the form of debt and the balance by the sale of equity securities— 
preferred and common stocks—or by retained earnings. As long as they 
their credit position, the utilities can probably raise capital 
by the sale of these bonds to life insurance companies. But what about 
balancing the sale of these bonds by that of equity securities on which, 
among other things, the maintenance of this credit position depends? 
During the last three years, the life companies have bought a rather 
large volume of utility and other preferred stocks. Recently, however, 
they have turned extremely cold toward this class of security, partly 
because some life companies have about reached their limit in this 
field, due to the reasons previously outlined, and partly because of the 
quite bad recent market bdxavior of these securities. Life companies 
have never been substantial buyers of utility common stocks. 

Because of this situation it seems likely that the utilities are going 
to have trouble balancing their bond sales with stock sales unless they 
are prepared to do their stock financing on a relatively high cost basis. 
In addition, recent events have proved that they must also increase 
the attractiveness of their preferred stock financing by the insertion of 
sinking funds or convertibility features. They may even be forced to 
substitute convertible debentures for equity securities in their capital 
structures as far as the raisii^ of new money is concerned. The sub¬ 
stantial withdrawal of the life companies from the preferred stock 
market has made clear the extent to which they have rather recently 
come to dominate that market. Their dominance of the corporate bond 
market has been clear for at least a decade. 

The sale of bonds has not been the traditional way for most branches 
of American industiy to raise capital. In the past, only a few branches 
of our industiy—^the most important of which are steel, rubber, meat 
packing, and tobacco—^have raised any appreciable part of their capi¬ 
tal throu^ the sale of bonds. On the other hand, some leading divisions 
of industry—^inoluding chemicals, electrical manufacturing, and many 
others—have been financed largely through equities. A good many 
types of industiy, because of the relative irr^ulaiity of their earnings, 
would probably be weU advised to avoid the use of contractual obliga¬ 
tions for raising capital, if at all possible. However, the relative ease 
with which the funds of life companies can be tapped through the debt 
route, compared with the very great difficulty of tapping them throi^ 
any other route, may put pressure on some industries to sdl bonds when 
preferred or common stocks would be more suitable. 
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The emergence of the life companies as the main suppliers of fixed 
return capital in this country has had a major influence on the under¬ 
writing and distribution of securities. The selling of securities in large 
blocks to life companies, plus competitive bidding, have caused new is¬ 
sues to be handled on very small gross profit margins to underwriters, 
which would have been impossibly small if sale and distribution had 
to be made to a host of individual buyers. In view of the current 
attitude of the life companies toward preferred stocks, these may again 
have to be sold to individuals; the gross profit to underwriters at which 
they are handled will have to widen materially. The coupon rates also 
will have to be increased materially above recent levels to be attractive 
to individuals in substantial income tax brackets. 

It is a well-known fact that keen rivalry exists among the large life 
insurance companies in the making of investments. They make their de¬ 
cisions on a highly individualistic basis and frequently pursue divergent 
investment policies. This, however, does not obviate the fact that imder 
present conditions, success or failure in the xmderwriting of corporate 
bond issues of any size depends largely upon the willingness of at least 
some of the largest life companies to buy substantial amoimts. Not 
only does this serve directly to get the bonds sold, but it no doubt in¬ 
fluences the action of smaller investing institutions. 

The dominance of the corporate bond market by the life companies 
has, of course, led to the private deal. It is probably natural that if a 
group of life companies are going to take most of an issue anyway, 
they should band together to shape the terms as much as possible to 
their liking, and as a secondary consideration save underwriters’ 
profits. From the point of view of investing institutions, the advantages 
of such deals are that stronger protective terms can be imposed than in 
corresponding public issues, and a little better yield obtained. 

From the point of view of the borrower, advantages include the 
avoidance of SEC registration, the chance to deal with only a limited 
number of professional investors in case a change in the terms is later 
desired, and the certainty of sale. Often an investment banker acts as 
agent in presenting the deal. In the case of a small issue of securities, 
particularly if put out by a borrower of secondary or unestablidxed 
credit standing, the saving to the borrower of going the private route 
may be very great indeed. 



MORTALITY STATISTICS FOR LIFE INSURANCE 
UNDERWRITING* 

Edwabd a. Lew 

Metropolitan Life Insurance Company 

The paper deals chiefly with those mortality statistics, com¬ 
piled from the experience of insured persons, that have been 
used as a guide in life insurance underwriting. Properly inter¬ 
preted, these statistics shed considerable light on.jbhe factors 
influencing human longevity. The use made of other mortality 
statistics in the underwriting of life insurance risks is also indi¬ 
cated. 

L ife insurance companies have conducted many investigations of 
mortality to obtain the statistics needed for the calculation of pre¬ 
miums, as a guide in the selection of risks, and for other purposes. 
Aside from serving their primary purposes, these studies contain a 
wealth of infoimation regarding the factors influencing human longev¬ 
ity. In order to interpret the results intelligently, it is necessary to 
know something about the particular purposes of the various studies, 
the methods used in the compilation of the statistics, and their limi¬ 
tations. 

This paper is confined to a consideration of the mortality statistics 
for life insurance underwriting. The term “life insurance underwriting” 
is applied to the procedures used by life insurance companies in pass¬ 
ing upon applicants for life insurance. Broadly speaking, the objective 
of these procedures is to appraise applicants’ prospects of longevity so 
as to classify them either into the various acceptable categories that 
have been established or the t3npe which should be rejected, and to 
determine the appropriate premium rates for applicants in the accept¬ 
able cat^ories. 

The aims of this paper are: 

(1) To sketch briefly how mortality studies for underwriting began 
in this country, 

(2) To indicate the extent to which underwriting considerations 
have shaped both the form and content of the mortality studies 
for life insurance underwriting, 

(3) To review the methods used in the more important investigations 
made by life insurance companies to obtain information for 
underwriting purposes, 

* A paper presented at the 107th Annual Meeting of the American Statistical Association, New 
York City, December 30,1947. 
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(4) To consider the principal findings of these investigations in the 
light of the limitations of the data, and 

(5) To examine the usefulness for underwriting purposes of mortality 
statistics from sources other than the experience of insured lives. 

ORIGIN OP MORTALITY STUDIES FOR UNDERWRITING 

The efficacy of life insurance undeirwriting depends lai^ely on knowl¬ 
edge regarding the effects on mortality of those characteristics of life 
insuranco applicants which can readily be ascertained in practice. That 
detailed quantitative knowledge is not absolutely necessary is indicated 
by the earlier successful record of the life insurance business, first in 
Great Britain and later on this continent. Even in the first part of the 
eighteenth century, there was sufficient qualitative understanding of 
many factors affecting human longevity, based on common sense ob¬ 
servations, to have led some English companies to charge extra pre¬ 
miums to travelers going to Ireland, to women of child-bearing age, and 
to persons who had not had smallpox. Generally speaking, however, life 
insurance underwriting before the advent of mortality statistics was 
primarily concerned with eliminating applicants who were in poor 
health or otherwise undesirable risks on account of intemperate habits, 
questionable morals, etc. 

More exact knowledge of mortality, based on statistical investiga¬ 
tions, had to wait until sufficient data had been accumulated and suit¬ 
able methods for extracting the information devised. That the data be¬ 
came available as soon as they did is in no small measure due to the 
remarkable growth of life insurance in the United States. With increas¬ 
ing numbers of persons rejected for life insurance during the last de¬ 
cades of the nineteenth century, there arose in the course of time an 
effective demand for the insurance of substandard risks. American com¬ 
panies sou^t to make such insurance availsble and turned to th^ 
actuaries to devise methods and to determine premium rates for insur¬ 
ing substandard risks. In 1889 the Actuarial Society of America was 
organized and this body naturally brou^t to bear on these problems 
the techniques originated and the experience with mortality investiga¬ 
tions gained by actuaries in Great Britain. No less opportune was the 
introduction at about this time of punch card equipment for making 
mortalily investigations. As a result of these developments, and because 
of the accumulation of a sizeable volume of data by reason of the rapid 
growth of life insurance, the Actuarial Society of America was able by 
1901 to undertake an extensive intercompany investigation (Spedalized 
Mortality Investigation) of various factors affecting the mortality 
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insured lives. This investigation was the forerunner of many similar 
efforts, both intercompany and by individual companies, designed to 
provide mortality statistics for life insurance underwriting. 

MOBTALITT STATISTICS ADAPTED TO UNDERWRITINO CONSIDERATIONS 

Mortality statistics derived from the experience among insured lives 
furnish the most pertinent guide for life insurance underwriting. This 
is because they show the actual results obtained for the type of informa¬ 
tion normally elicited in life insurance applications and from other 
sources available to life insurance companies. 'While our general 
knowledge of the factors influencing mortality has been greatly en¬ 
riched by the increasing numbers of statistical studies made by 
research groups, hospitals, physicians, and others, such studies have 
sometimes only limited applicability to life insurance underwriting. 

Mortality fodors considered in underwriting. Most of the mortality 
investigations made by life insurance companies to obtain information 
for underwriting purposes have been concerned with those particular 
characteristics known to affect mortality about which information 
could readily be obtained by an insurance company. In addition to age, 
the principal characteristics so used in appraising applicants’ prospects 
of longevity have been: (1) occupation, (2) height and weight, (3) 
phyrical impairments brought to light by the usual type of medical 
examination for life insurance, (4) personal medical history and family 
history as disclosed in response to questions asked in life insurance ap¬ 
plications, and (5) habits as reported by independmt inspections. These 
are by no means the only characteristics hi^y correlated with mor¬ 
tality variations or even necessarily the most important ones, but over 
the years it has been found possible to secure satisfactory information 
about them throu^ the channels available to life insurance com¬ 
panies. Details of these characteristics are obtainable more accurately 
on applications for Ordinary insurance than on other forms of life in¬ 
surance and as a result studies of mortality made for imderwriting 
purposes have been predominantly based on the experience of lives 
insured under Ordinary policies. 

Numericcd rating system. Gradations in the factors commonly con¬ 
sidered in underwriting are expressed in terms of the excess mortality 
they are expected to bring about among lives otherwise acceptable for 
insurance at standard premium rates. This excess mortality for a group 
having a specific characteristic, stated as a percentage of the mortality 
among a similarly constituted group of lives acceptable for standard in¬ 
surance, is referred to as the rating for the characteristic; by adding the 
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ratings for various characteristics of a risk, treating interrelated char¬ 
acteristics as a single factor, a numerical appraisal of the total risk 
is obtained. This method of appraising longevity in numerical terms 
(known as the numerical rating system) requires that the ratings for 
the various characteristics be based on actual experience whenever 
possible. 

Mortality ratios. To facilitate interpretation of mortality statistics 
as ratings, mortality data for insured lives have customarily been sum¬ 
marized as ratios of the deaths actually experienced in a group to the 
deaths that would have occurred had the group been composed of other¬ 
wise comparable lives acceptable at standard premium rates. Such 
ratios show only the relatwe mortality associated with a particular char¬ 
acteristic during a given period of observation, but this is the criterion 
usually employed in classifying risks for underwriting purposes. It 
should be noted that in determining both the numerator and the de¬ 
nominator of these ratios, the number of policies terminated by death 
(or the amoxmts of claims paid) has generally been used in lieu of the 
number of persons involved. While the mortality ratios by number of 
policies (or amounts of insurance) may occasionally exhibit extraneous 
fluctuations due to a large number of policies (or a substantial amount 
of insurance) being involved in a single death, they have the advantage 
of reflecting more accurately the financial effect of the extra mortality 
traceable to particular characteristics. 

Risks appraised as members of a dass ratJier than as indioiduols. A 
very precise appraisal of various characteristics in numerical terms is 
neither practicable nor necessary for classifying applicants as life in¬ 
surance risks. Only on very large amoimts of insurance does it become 
worth while to make elaborate investigations. For most practical pur¬ 
poses, it is sufficient to be able to make decisions as to the broad class 
of risk to which a particular applicant should be assigned. The averag¬ 
ing of risks is a fundamental principle of insurance and the underwriter 
accordingly looks upon risks more nearly as members of a class than as 
individuals. In line with this principle, ratings are more appropriately 
regarded as mean values of a reasonable range of variation. In making 
mortality studies of various characteristics for the purpose of deter¬ 
mining ratings, actuaries have, therefore, often preferred to deal with 
risks in broad groups, whose heterogeneity corresponded to that of the 
rating classifications, rather than subdivide the experience into more 
homogeneous but smaller groupings subject to large chance fluctua¬ 
tions. 

Risks dassified m bads of characterisUcs at time of application. Life 



278 


AMEBICAN' STATISTICAL ASSOCIATION 


insurance policies do not as a rule contain any provisions which enable 
the company to change the classification of a risk after issue, even 
though the risk increased materially. Applicants for life insurance must, 
therefore, be appraised on the basis of their characteristics at time of 
application for the insurance, taking into consideration the possibility 
of the risk’s increasing. This has important implications as to the type 
of mortality statistics needed for life insurance underwriting. For char¬ 
acteristics that can be changed by the insured at his own volition, such 
as occupation or aviation activities, the ratings must be determined 
from an experience of closed groups of individuals having the particular 
characteristics at time of entry; such experience should reflect the effect 
of the usual proportion of individuals changing to a more hazardous sta¬ 
tus as well as the proportion of individuals who (diange to a less hazard¬ 
ous status and leave the group. For characteristics where the excess 
mortality does not manifest itself until many years have elapsed, such 
as overwei^t or moderate elevation in blood pressure, the ratings must 
be determined from experience of sufficiently long duration. 

Incidence of mortality by age and duraiim. Because premiums for 
life insurance depend to a marked degree on the incidence of mortality 
by age and duration from issue, mortality statistics for life insurance 
underwriting have usually been compiled so as to bring out the varia¬ 
tions in mortality by age and duration. Such information is not only 
helpful in appraising and classifying risks but indicates also the type 
of extra premium to be charged—constant with age, increasing with 
age, decreasing with duration, etc. Mortality statistics by number of 
policies and especially by amounts of insurance are, of course, more 
suitable than mortality statistics by lives for the purpose of determining 
extra premium rates. 

liETHODS USED IN UOBTALITT INVESTIGATIONS OF INSUBED LIVES 

IrUercompany mortality investigations. The principal investigations 
made to obtain mortality statistics for underwriting purposes have 
been carried out by the Joint Committee on Mortality of the Actuarial 
Society of America and the Association of Life Insurance Medical 
Directors. The first study made by this Committee (Medico-Actuarial 
Mortality Investigation) was based upon the combined e^erience of a 
ninnber of companies under Ordinary policies issued during the years 
1885-1908 and included avei&ge hm^ts and weights, the effects of 
build on mortality, the mortality of women and certain races, mortality 
by occupation and mortality for various medical impairments. Since 
that time, nine additional mortality investigations have been made by 
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the Joint Committee, covering chiefly medical impairments and occu¬ 
pations.^ 

These later investigations have all dealt vrith the combined experi¬ 
ence among lives insured under Ordinary policies in the larger life in¬ 
surance companies of the United States and Canada. The usual 
procedure has been for individual companies to send to the Joint Com¬ 
mittee summary sheets for each class of risk included in the study 
shovdng (1) the “exposed to risk,” (2) the “expected” deaths, and (3) 
the actual deaths, tabulated by broad age groups at issue and appropri¬ 
ate groupings by duration from issue. As the basis for calculating the 
“expected” deaths in each investigation, the Joint Committee has 
constructed special mortality tables (often referred to as “basic tables”) 
in “select” form, i.e., showing mortality rates by age groups at issue 
and duration from issue. These tables represent the companies’ com¬ 
bined experience under Ordinary life insurance issued at standard 
premium rates for the period of years covered by the investigation. Spe¬ 
cial tables have also been prepared to serve as yardsticks of the 
expected mortality from the major causes of death. 

The data for these mortality investigations have been assembled by 
the individual companies on punch cards, prepared in the larger com¬ 
panies for a sample of the Ordinary policies issued at standard premium 
rates and for all Ordinary policies issued at substandard premium 
rates. The punch cards have been coded for occupation, medical im¬ 
pairments, and certain other characteristics in accordance with codes 
specially prepared by the Joint Committee on Mortality. Occasionally, 
it has been necessary to go back to application papers in order to study 
characteristics which were not distingui^ed in the codes publiriied by 
the Joint Committee. 

From time to time, the Joint Committee has asked the companies 
for suggestions as to the classes of risks whose mortality it would be ad¬ 
visable to investigate. The classes favored have then been reviewed to 
determine whether the volume of data was large enough to warrant 
riieir inclusion in the study. For this purpose the larger companies have 
maintained up-to-date tabulations of the number of deaths on policies 
with rateable occupations, medical impairments, etc. 

As a general rule, in selecting cases with a particular characteristic 
for inclusion in a study, it has been customary to eliminate those that 


1 Blood Pressure Study 1925, Joint Occupation Study 1928, Medical Impairment Study 1929; 
Supplement to Medical Impaixment Study 1932, Impairment Study 1936; Ocoupationsl Study 1937; 
Impairment Study 1938; Blood Pressure Study 1939; Supplement to Blood Pressure Study 1939. 



280 


AMEElCAN STATISTICAL ASSOCIATION 


also presented other characteristics which by themselves would have 
precluded the issuance of a policy at standard rates. On the other 
hand, cases accepted at standard rates have been included even 
though they might exhibit minor characteristics not of sufficient im¬ 
portance to require an extra premium but nevertheless of possible sig¬ 
nificance. Under these circumstances, the experience of a class may to 
a degree reflect the excess mortality attributable to the cases with 
minor characteristics. The proportion of cases with such minor charac¬ 
teristics has usually been a factor in determining w^hether a particular 
class should be studied, and in some investigations the cases with 
minor characteristics have been eliminated to obtain an experience 
more truly representative of the characteristic under study. 

Shortcomings of intercompany studies. There has been some question 
as to whether mortality investigations of the combmed experience of 
many companies with different underwriting practices and standards 
yield significant results for certain classes of risks. The point has been 
made that the mortality experience of companies having similar under¬ 
writing practices and standards could be interpreted more accurately 
and that the best guide for an individual company w^as its own experi¬ 
ence. Because of different application forms, different medical exami¬ 
nation requirements, and different attitudes toward indefinite reports, 
it has been found that the cases included by different companies in 
classifications which are not susceptible to a precise definition may 
differ markedly. However, the data included m classifications that can 
be clearly defined have been found to give a satisfactory degree of 
homogeneity. Present opinion favors a continuation of mortality inves¬ 
tigations of the combined experience of many companies for classes 
where the data are homogeneous or where a sufficiently large experi¬ 
ence could not be obtained otherwise. 

Individual company and other mortality studies of insured lives. Studies 
of the experience of individual companies have also contributed much 
information regarding effects of various characteristics on mortality.® 
The results have generally been easier to interpret than those of inter¬ 
company investigations because of greater homogeneity of the data and 
the greater ease with which questionable points could be reviewed. On 
the other hand, small numbers have been more frequently encountered 
in individual company studies, so that recourse has sometimes been 


> Among the more notable of these ha-ve been a series of papers by A. Hunter and O. Bogets {Trans- 
actions of the Actuarial Society, Vols. XXFTI to XXX) and another series by L. I. Dublin in collabora¬ 
tion mth various others (Proce^ings of the Association of Life Insurance Medical Directors, Vols. XV, 
XES to XXVI, and XXYIII). 
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had to more advanced statistical techniques to gauge the reliability of 
the results. While individual companies’ mortality investigations have 
as a rule been made from the same punch cards as those used in inter¬ 
company investigations, the methods employed have, of course, varied 
considerably. For instance, some companies have occaaonally deter¬ 
mined their experience for various classes of risks by relating all the 
deaths in a class to closures built up from a sample of the policies 
issued.* 

ITie techniques used in follow-up studies of individuals subjected to 
special examinations and of rejected risks also deserve attention be¬ 
cause of their applicability to other fldds. The usual practice has been 
to trace individuals dbiefly by reference to their insurance holdings in 
various companies. It now appears possible that Social Security records 
might be utilized for this purpose, if suitable arrangem^ts can be 
worked out with the Social Security Administration, and the records 
are handled in such a manner that particular individuals cannot be 
identified. 

TINDINGS OT THE PBINCIFAIi MORTAUTT STUniES OF INSURED LIVES 

In looking at the findings of the principal mortality studies of insured 
lives, it is necessary to keep in mind that they represent the experience 
among persons insured imder Ordinary policies. Such persons comprise 
a highly selected portion of the general population; they are pre¬ 
dominantly white male adults, and in the past at least were drawn 
chiefly from the more prosperous classes of the population. Even when 
insured at substandard premium rates, they usually do not present 
characteristics that result in a drastic reduction in longevity. 

While absolute death rates for various classes of risks can be derived 
from the mortality ratios developed in these investigations, the mor¬ 
tality ratios themselves indicate only the relative mortality of different 
classes of risks, compared with the level of mortality among lives in¬ 
sured at standard premium rates during the same period of years. 
Since the level of mortality among lives insured at standard rates has 
been changing over the years, different yardsticks of mortality have in 
effect been used in different studies. As the level of standard mortality 
has declined, the relative mortality of risks whose absolute mortality 
rates remaiaed unchanged has increased, while the relative mortality 


> See paper ‘Sampling Errors in Life Insurance Mortality and Other Statistics” by Donald Cody 
to be published in the September 1948 issue of the Journal c/ the Amenean StattOnctd Aeaocialwn. 
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of risks whose absolute mortality rates decreased more rapidly than 
that of standard lives has declined. 

Most of the mortality investigations of insured lives represent ex¬ 
periences spanning relatively short periods of time. The great volume of 
the experience has usually been in the years immediately following 
issue of the insurance, while the experience at the longer durations has 
sometimes been too scant to give reliable information. This is a serious 
shortcoming in the case of those classes of risks where the extra mor¬ 
tality is deferred, since the results will tend to understate the excess 
mortality that may actually be experienced over the entire period of 
insurance. 

Despite these and other limitations, mortality studies of insured lives 
provide much pertinent information of a broad qualitative nature as 
to the effects on mortality of occupations, height and weight, physical 
impairments, and medical histories. In interpreting the experience of 
particular groups of lives, it is essential to have a clear idea as to the 
type of cases included in each classification. These points are illustrated 
in the following discussion of the findings of the more important studies. 

OecupcUions. The first extensive intercompany investigation (Special¬ 
ized Mortality Investigation) covered the eiqierience during the years 
1870-1900. It included a study of 35 groups involving an occupational 
hazard, among th^ commissioned officers, the more important hazard¬ 
ous trades, liquor distribution, and railroad service. 

The Medico-Actuarial Mortality Investigation, which covered the 
years 1885-1909, included 168 groups of occupations. The most im¬ 
portant of these were m the liquor industry, railroad service, metal 
trades, and mining. This study showed that mortality was particularly 
hi^ among undei^ound miners, railroad engineers and firemen, struc¬ 
tural iron workers, stone cutters, and many occupations in the liquor 
industry. In certain hazardous occupations, the unusually low mor¬ 
tality experienced was believed to be due to very careful sdiection of 
risks by the companies and to the short average duration of the ex¬ 
perience. 

The most comprehensive investigation of occupations thus far con¬ 
ducted was the Joint Occupation Study. It called for the study of some 
430 groups of occupations and covered the years 1920-27 for occupa¬ 
tions with a major accident hazard and the period 1915-27 for other 
occupations. The occupations studied ran practically the entire gamut 
of occupational hazards commonly met with among applicants for 
Ordinary life insurance. Very hi^ mortality was found among metal 
miners, coal miners, oil and gas field rig builders, linemen in electric 
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li^t and power plants, cMppers of metal, laborers in iron and steel 
works and foimdries, and bartenders. Tuberculosis was found to be 
high in some 25 occupational classes, chiefly among unskilled labor and 
in occupations characteristic of the lower income groups, and also 
among those exposed to the silica dust hazard, notably stone cutters. 
A special study was also made of the accidental death rates, differen¬ 
tiating between those due to occupational and non-occupational acci¬ 
dents. The hipest accidental death rates were found amoi^ electric 
li^t and power linemen, oil and gas field rig builders, underground 
miners, structural iron workers and freight train brakemen. 

The most recent of the investigations dealing with occupational 
hazards—^the Occupational Study of 1937—covered the years 1925-36 
and included 87 occupational groups. Among them were many for 
which the results in the Joint Occupation Study had been inconclusive. 
Very high mortality was found among metal miners (high accidental 
death rate, tuberculosis and pneumonia), motorcycle police (high ac¬ 
cidental death rate), anthracite coal miners (high accidental death rate, 
tuberculosis and pneumonia), shippers of metal (high tuberculosis), 
electric light and power linemen (hi^ accidental death rate), and stone 
cutters (high tuberculosis). The study showed a decrease in the mor¬ 
tality of laborers in non-hazardous industries, believed to reflect in 
some measure the improvement in the living conditions of the lower 
income groups. 

A review of the findings of these investigations indicates that the 
mort^ty in occupations with pronoimced accidental death or health 
hazards has in general increased relative to the mortality of lives in¬ 
sured under Ordinary policies at standard premium rates; on the other 
hand, the relative mortality in many occupations where the excess 
mortality was at one time largely due to low living standards appears 
to have decreased. This supports the agnificant conclusion reached 
from various studies that the effects of occupation on mortality are, on 
the whole, more indirect than direct; mortality in most occupations is 
influenced more by the environment and the living standards associ¬ 
ated with the occupation than by the accident and health hazards 
traceable to particular occupational duties. 

Height and Weight. The first table of average wdghts according to 
height and age, based on a considerable volume of statistics, was pre¬ 
pared from life insurance data by the Association of life Insurance 
Medical Directors in 1897 and was known as the Shepherd table. 

The Medico-Actuarial Mortality Invest^ation developed new tables 
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of average heights and wei^ts by age for men and ■women and also in¬ 
cluded an extensive investigation of the effects of build on mortality. 
The height and 'weight tables ■were based on the records for 221,819 
males and for 136,504 females, representing a random sample of the 
heights and weights, in street clothing (without overcoats), of persons 
to whom standard Ordinary policies were issued mainly during the 
years 1885 to 1900, inclusive. These tables of average hdghts and 
weights remain in general use today, because subsequent studies diowed 
that there has been little change in average heists and weights over 
the years. The investigation of the effects of build on mortality brought 
out the following main conclusions: 

(1) That mortality increases ■with decreasing weight at the younger 
ages, 

(2) That the effect of underwei^t on mortality decreases ■with ad- 
vancii^ age, 

(3) That overweight is not a serious impairment at the younger ages 
but its adverse effect on mortality increases ■with advancing age, 

(4) That the lowest mortality at ages under 35 is found among those 
15 to 20 pounds above average but at ages 35-49 it occtirs among 
those 5 to 10 pounds underweight, 

(5) That young underweights have hi^ mortality from tuberculosis 
and pneumonia, 

(6) That overweights have high death rates from diabetes, nephritis, 
heart disease, and cerebral hemorrhage. 

Average heights and weights were determiaed again in the Medical 
Impairment Study for persons to whom standard Ordinary policies 
were issued during the years 1909-27, but no new tables were published 
for ages 25 and over because the figures developed were found to be 
remarkably close to those established by the Medico-Actuarial Mor¬ 
tality Investigation. New tables of average heights and wei^ts were, 
however, compiled for ages 10 to 24. The Medical Impairment Study 
also confirmed the major findings of the Medico-Actuarial Mortality 
Investigalion in regard to the effect of build on mortality. 

It appears likely that the rdative mortality of both overwei^ts and 
underweights has changed somewhat since the publication of the 1929 
Medical Impairment Study. This study did not extend over the recent 
period of years during wMdi mortality from tuberculosis and pneu¬ 
monia showed a marked decline. Since these causes of death have in the 
past accoimted for practically all of the excess mortiJity among under* 
wei^ts, it would be reasonable to suppose that the mortality of under- 
wm^ts has improved in relation to that of lives insured under Ordinary 
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policies at standard premium rates. On the other hand, there has ap¬ 
parently been little change in the mortality from heart disease, diabetes, 
and other degenerative diseases, which have accounted for most of the 
excess mortality among overweights. This suggests that the relative 
mortality of overweights may have increased. 

Physical Impairments and Medical Histones. The particulars given in 
life insurance applications concerning occupation and height and 
weight can as a rule be taken without qualification. However, this is 
not the case in much of the information elicited regarding physical im¬ 
pairments and medical histories. 

The medical examination for life insurance consists usually of two 
parts. The first is made up of the applicant's own statements regarding 
details of his physical condition, history of illnesses, injuries, and medi¬ 
cal treatment received, as well as some miscellaneous matters such as 
family history. The second part is in the nature of a report by the 
medical examiner on the physical examination of the applicant; this 
examination customarily embraces a check on the important organs of 
the body, height and weight, a blood pressure reading and urinalysis. 
Special tests such as an electrocardiogram. X-rays, and blood sugar 
tests are expensive and can be used only on large amounts of insurance 
or in other cases where they are clearly of great value. There are also 
practical limits on the extent to which indefinite statements made by 
the applicant or the medical examiner can be followed up. The result 
is that many of the descriptions of physical impairments and medical 
histories on life insurance applications represent relatively undifferen¬ 
tiated conditions. The findings of the physical examination itself cannot 
usually be compared in point of comprehensiveness and precision -with 
those made by clinicians who are in position to study a patient for a 
longer period of time, more intensively and with his full cooperation. 
As a consequence, the mortality experienced among insured lives with 
certain physical impairments or medical histories furnishes a guide of 
only limited relevance for cases where specific and accurate diagnoses 
have been established. 

Because persons suffering from serious organic diseases or from acute 
illnesses would not ordinarily apply for or be accepted for life insurance, 
the mortality studies of insured lives have perforce dealt with the less 
severe and non-acute conditions. With few exceptions, clinical medicine 
has not been able to follow large numbers of patients with relatively 
minor impairments for a sufficiently long period of time to determine 
their effect on mortality. Despite their limitations, mortality studies 
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of insured lives constitute, therefore, a major source of information 
regarding the significance of many such impairments. The most valu¬ 
able contributions to medical science made by mortality studies of 
insured lives have probably been with respect to factual conditions 
that can be appraised in numerical terms, such as build, blood pressure, 
urinalysis findings and pulse rate. 

The earliest of the intercompany studies—^the Specialized Mortality 
Investigation—^included 24 groups of physical impairments and medi¬ 
cal histories. However, only cases accepted at standard premium rates 
were included and some of the classes were described in terms no longer 
current. The Medico-Actuarial Mortality Investigation included 83 
groups of physical impairments and medical histories, all but two of 
which were lunited to eases accepted at standard premium rates. The 
list of impairments studied was not greatly different from that of con¬ 
ditions considered significant today. Among the many results of in¬ 
terest, it was found t^t persons with a rapid pulse rate of between 90 
and 100 per minute showed substantially hi^er than normal mor¬ 
tality, while those with a pulse rate over 100 per minute experienced 
even hi^er mortality. Later studies corroborated these findings. 

The most comprehensive of the intercompany investigations of phys¬ 
ical impaiiments and medical histories was the Medical Impairment 
Study. It covered the experience on policies issued during the years 
1909-27 and included 122 groups of impairments and histories, many 
being further subclassified by duration and severity. Cases accepted at 
extra premium rates as well as those accepted at standard premium 
rates were studied. Because of the inclusion of cases accepted at extra 
premium rates, the mortality associated with a number of more serious 
conditions was developed for the first time in an intercompany study. 

Probably most important among the serious conditions were several 
classes of heart impairments, diagnosed primarily by reference to t 3 rpes 
of heart murmurs they produce. The mortality experienced on certain 
of the heart murmurs confirmed clinical opinion to ihe effect that they 
were relatively unimportant, but other t 3 T>es of murmurs were found 
to be associated with very high mortality. A heart murmur in conjunc¬ 
tion with a moderate degree of hypertrophy of the heart and particu¬ 
larly with a history of rheumatic fever was shown to be a very serious 
condition. Heavy manual work was found to have an adverse effect on 
the mortality of those with heart disease. 

The Medical Impairment Study included also the e^erience for 
other combinations of impairments such as heart murmurs with ele¬ 
vated blood pressure, albumin with casts, and albumin with devated 
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blood pressure. It was found that the combination of albumin and over¬ 
weight presented a very much more serious impairment than the sum 
of the extra mortality associated with each of these conditions would 
indicate. 

The Medical Impairment Study brought out clearly for the first time 
that even small elevations in systolic blood pressure are associated with 
a marked increase in mortality. This finding was at variance with gen¬ 
eral medical opinion and a number of additional investigations were 
made to learn more about the effects on mortality of variations in blood 
pressure. The most important of these was the 1939 Blood Pressure 
Study, which considered the variations in both S 3 rstolic and diastolic 
pressures. The main conclusions of ibe several studies of blood pressure 
were as follows; 

(1) A ssrstolic blood pressure of 20 mm. above average or a diastolic 
blood pressure of 15 mm. above average results in distinctly 
higher mortality than normal. 

(2) Mortality is heavier where both systolic and diastolic pressures 
are high. 

(3) The combination of elevated blood pressure with certain minor 
impairments, particularly those of the cardiovascular-renal sys¬ 
tem, produces appreciably higher mortality than the sum of the 
extra mortality associated with the respective conditions. 

(4) The lowest mortality is found in the group with S 3 r 8 tolic pressures 
from 5 to 15 mm. below average. 

Two additional mortality studies supplemental to the Medical Im¬ 
pairment Study were made in 1936 and 1938. They covered 42 classes 
of ph 3 rsical impairments and medical histories. Further evidence was 
adduced by these studies as to the adverse effect on mortality of gastric 
and duodenal ulcers, removal and drainage of gall bladder, toxic goitre, 
syphilis, renal colic, irregular and intermittent pulse and albuminuria. 
The findings for several of these classes have probably lost some of thrir 
rignificance because of the recent advances in medical treatment and in 
surgery; the relative mortality of conditions that may require surgery, 
such as gastric and duodenal ulcers, is likely to have declined somewhat 
and the same probably applies to the mortality associated with S 3 q>hi]is. 

STATISTICS BSOM SOUBCSS OTHKB THAN THE 
EXFEBIENCE OF INSUBED LIVES 

It usually takes a long time to accumulate suffident data for a mor¬ 
tality investigation of insured lives with relatively rare characteristics. 
For many types of risks it is virtually impossible to obtain mortality 
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statistics based on the experience of insured lives. In such cases the 
underwriter naturally falls back on whatever other information is avail¬ 
able. Another difficulty is that by the time the mortality statistics have 
been compiled, the relative hazard of a characteristic may have 
changed materially. This has occurred in the case of certain occupations 
where the working and living conditions have improved markedly in 
recent times, probably also to some extent for overweights and under¬ 
weights because of the changing mortality trends, and for a few medical 
conditions where new and radical methods of treatment have been dis¬ 
covered. Since the underwriter is primarily concerned with the mor¬ 
tality likely to be experienced on various types of risks in the future, he 
must keep in close touch with changing conditions in industry, with 
developments in medicine and public health, and with mortality trends 
in general. The availability of more and better public health statistics, 
medical statistics, and general population statistics has been of great 
help to actuaries and underwriters in making judgments regarding the 
probable future mortality of different classes of risks. 

Life insurance companies have made considerable use of the records 
available in governmental agencies and in industry to estimate death 
rates for many occupations. The statistics maintained by governmental 
agencies have made it possible to determine the mortality rates among 
personnel of the armed forces, scheduled airline pilots and state police, 
to mention only a few. It has been more difficult to derive death rates 
for various occupations in industry, because records for individual oc¬ 
cupations are rarely segregated and because often only the deaths due 
to occupational accidents can be traced. Even this limited information, 
however, has been of value in considering occupations where accidents 
are the major hazard. It might be noted that appraisals of occupational 
hazards based on government or industry statistics represent usually 
the death rates among those actively at work. For underwriting pur¬ 
poses, it may be necessary to make allowances for the higher mortality 
among those in the occupation who have become disabled and left work 
and for the proportion of the individuals who change to more hazardous 
occupations in the normal course of events. 

Statistics compiled by hospitals, physicians and public health agen¬ 
cies have also been widely consulted for life insurance underwriting. 
Clinical statistics have been helpful in detenmnmg appropriate ratings 
for the many relatively rare conditions on wrhich life insurance experi¬ 
ence can shed no light. Such data have also proved of value in indicat¬ 
ing under what circumstances a relatively serious unpainnent, such as 
diabetes, pre'\iously considered uninsurable, may be accepted at extra 
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premium rates. Although public health and population statistics have 
served chiefly to bring out changes in mortality trends, they too have 
occasionally been used as a basis for ratings. The application of clinical 
data to medical underwriting has presented many difficulties: for in¬ 
stance, the problem of translating the findings of accurate clinical 
diagnoses into the less definite categories found on life applications, and 
the difficulty of allowing for the difference between the details normally 
disclosed by a patient to his physician and the details stated on an ap¬ 
plication for hfe insurance, which may refer to conditions that have 
never received medical attention. It has also been necessary to make 
allowance for the relatively short term outlook in clinical medicine in 
contrast to the long term view of life insurance underwriting. The in¬ 
creasing use of follow-up studies by research groups, hospitals, and 
physicians indicates a growing appreciation of the need to study the 
long range effects on mortality of various medical conditions. 
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STATISTICAL PROBLEIMS IN DEVELOPING VOLUNTARY 
MEDICAL EXPENSE INSURANCE PLANS* 

Morton D. Miller 

The Equitable Life Assurance Society of the United States 

Of the 45,000,000 persons having hospitalization insurance, 
20,000,000 have become covered by some form of insurance 
for the expenses of medical care. Most of this coverage has 
been provided since 1938, principally by the life insurance 
companies through group insurance. This paper discusses the 
statistical problems accompan 3 ring the development of medi¬ 
cal expense insurance plans: the sources of data, the difficulties 
encountered in adapting the data to the purpose, and the na¬ 
ture and purpose of the statistics to be collected as experience 
unfolds. Insurance company experience with medical expense 
insurance is producing much valuable information regarding 
the demand for various medical services, the cost of such serv¬ 
ices and the administrative problems involved in providing for 
medical services on a prepayment basis. 

T he tremendous growth of hospitalization insurance is a fairly 
familiar story. In the relatively short period since 1930, the number 
of persons insured has risen from almost nothing to the point where 
today more than 45,000,000 persons are covered for the cost of hos¬ 
pitalization. The development and spread of voluntary plans to insure 
the cost of doctors' services promises to be equally phenomenal. 

It is only since 1938 that the established insurance companies have 
been insuring surgeons' charges for the performance of surgical opera¬ 
tions and obstetrical procedures, and since 1943 that the companies 
have been covering the cost of other types of medical services. Now in 
1947, over 20,000,000 persons are insured for surgical benefits, of whom 
more than i are covered in addition by some form of medical expense 
insurance for non-surgical services. 

While community organizations similar to and sometimes aflSliated 
with the Blue Cross hospital service associations have participated in 
the development of medical expense insurance, the established insur¬ 
ance companies have taken the lead in promoting these new forms of 
protection. The companies insure the majority of the persons covered, 
principally through group insurance written by the life insurance com¬ 
panies. Group insurance, as distinguished from individual insurance, 
is provided by means of a single master contract issued usually to an 
employer to insure the group of his employees and frequently their 
wives and children. 

* A paper presented at the 107th Annual Meeting of the American Statistical Association, New 
York City, December 30,1947. 
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Group surgical insurance provides cash reimbursement for the 
amount of the surgical or obstetrical fee, subject to a maximum 
amount of reimbursement which is dependent on the nature of the 
operation. The maximum amounts of reimbursement with respect to 
various types of operations are set forth in a schedule that appears in 
the master group policy and in the certificates of coverage furnished to 
the individual employees. In those cases where the surgeon’s charge is 
in excess of the maximum amount of reimbursement for the operation, 
the insured employee receives the maximum reimbursement from the 
insurance company and pays the excess from his own funds. Coverage 
of obstetrical procedures is optional but is usually included. 

A typical schedule of maximum reimbursement will be found to pro¬ 
vide a maximum reimbursement of 5150 for the most serious opera¬ 
tions, such as cutting into the cranial cavity, maximum reimbursements 
of $100 for an appendectomy, $50 for an operation for a single hernia, 
$25 for a tonsillectomy, and $50 as the obstetrical benefit for a natural 
delivery. 

Although group surgical insurance is a form of group medical expense 
insurance, the term “group medical expense insurance” has come to be 
used to describe the plans of group insurance covering the costs of 
medical services for other than surgery and obstetrics. A wide variety 
of such plans with differing benefits is available. Reimbmrsement may 
be provided fof doctors’ visits, ustially in the amount of $2 for visits 
by the patient to the doctor’s office or by the doctor at a hospital, and 
in the amoimt of $3 for visits by the doctor to the patient’s home. 
Coverage may be comprehensive and include reimbursement for the 
expense of all visits; sometimes the first one or two visits in any illness 
are excluded. More limited coverage plans are also being written. Under 
total disability plans, reimbursement is made only for the doctor’s 
visits needed in illnesses or injuries which are disabling; under in-hos- 
pital plans, only those visits in medical cases requiring hospitalization 
are reimbursed. The coverages for doctors’ visits may be supplemented 
by additional coverage for the cost of diagnostic x-ray and laboratory 
examinations, in accordance with a schedule of maximum reimburse¬ 
ment for such examinations analogous to the schedule of maximum 
reimbursement for surgical operations. 

These new forms of insurance involve numerous insurance and sta¬ 
tistical problems. This is equally true whether the coverage is being 
written by an established insurance company or by a community plan, 
similar to the Blue Cross, although sometimes the sponsors of such 
plans do not appreciate this fact as fully as they might. 
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The first problem is that of securing some basis for the establishment 
of the initial premium rates. The basic factors required for the calcula¬ 
tion of premium rates are: first, a rate or frequency of occurrence of 
the event insured against, for example, the annual rate of surgical 
procedures per himdred insured; and, secondly, the average amount 
of the benefit payable when the event occurs. 

While there were no insurance data directly applicable to the calcu¬ 
lation of premium rates for group surgical insurance, experience de¬ 
veloped in connection vdth hospitalization and other forms of insurance 
did furnish collateral information that was helpful. One company com¬ 
puted a scale of premium rates for group surgical insurance as follows. 

At first, surgical coverage was limited to those cases where the sur¬ 
gery was performed in a hospital. This made it possible to base the an¬ 
nual frequency of surgical procedures on the annual rate of hospitaliza¬ 
tion, which experience showed to be 7 per hundred for male employees. 
It was observed that J of aU hospitalized cases involved surgery and 
accordingly the annual frequency of surgical procedures for male em¬ 
ployees became 5.3 per hundred. Then, after study of the proposed 
schedule of benefits, it was estimated that S50 would be the average 
amount of each claim. The product of the frequency and the average 
benefit gave $2.65 for the net annual claim cost for a male employee. 
This figure, when increased by the amount calculated to be necessary 
for expenses of operation and contingencies, resulted in a gross monthly 
premium rate of $.35 per male employee. 

The gross monthly premium rate for a female employee was set at 
$.70, double that for a male employee. This was consistent with the 
experience under other forms of disability insurance; including both 
weekly indemnity benefits and hospitalization insurance, where the 
claim cost of employed females was observed to be twice that of males. 
In practice, separate premium rates were not charged for male and 
female employees but a composite rate was computed at the effective 
date of the group policy depending on the percentage of female em¬ 
ployees covered. If less than 11 per cent of the coverage was on fe¬ 
male employees, the premium rate was not increased above the male 
rate of $.35. If more than 11 per cent and less than 21 per cent of the 
coverage was on female employees, the basic male rate was increased 
by 15 per cent. If the percentage of coverage on females was more than 
21 per cent and less than 31 per cent, the basic male rate was increased 
by 25 per cent and so on. 

Insurance was also to be offered for the wives and children of em¬ 
ployees, and so a separate premium rate had to be calculated for 
family coverage. A gross monthly cost of $.70 was arrived at for non- 
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maternity surgery for a dependent wife and a cost of $.60 for obstetri¬ 
cal benefits. For a child, the gross monthly cost was estimated to be 
th< same as for a male employee, namely $.35. It was decided to charge 
the same premium for each family covered regardless of how many 
children the family included; and for this purpose it was assumed that, 
cn the average, a dependent family unit would be composed of a wife 
*jjid two children. Thus, the gross monthly premium rate for dependents 
became $1.40 per family imit where obstetrical benefits were not to be 
included and $2.00 per family unit where obstetrics were to be covered. 

The initial basic monthly premium rate for employee coverage was 
subsequently increased from $.35 to $.40 when the benefits were ex¬ 
tended to cover out-of-hospital as well as in-hospital surgery and has 
since remained imchanged. The rates for family coverage were progres¬ 
sively reduced as experience developed and it was seen that they were 
redundant. The present gross monthly rates for the same company 
are $1.20 per family imit without obstetrical benefits and $1.55 includ¬ 
ing obstetrical benefits. 

The construction of initial premium scales for the group medical 
expense insurance plans was similar to the calculation described in con¬ 
nection with surgical benefits. However, the statistical basis was even 
less satisfactory. It was necessary to fall back to a greater extent on 
studies made of the utilization of medical care by uninsured individuals. 
What there were of such studies were usually quite limited in their use¬ 
fulness. Generally, the number of persons under observation was small 
the locale of the group being investigated was restricted and the amount 
of pertinent information secured was meager. Furthermore, since these 
studies were undertaken mostly from the point of view of the economic 
theorist or sociologist, they are not very easily adapted to the require¬ 
ments of the insurance actuary. 

A notable exception was a study conducted during the period from 
1928 to 1931 on a sample of 9,000 families, chosen geographically and in 
income groups so as to be representative of the white population of the 
nation. Each family was interviewed at intervals over a twelve months 
period in order to secure detailed information regarding the incidence 
of illness and the extent and kind of medical care sought in connection 
with it. The statistical data were analyzed and published in the United 
States Public Health Reports and also were used extensively in the 
series of monographs published by the Committee on the Costs of 
Medical Care. 

Much caution has to be exercised in using investigations based on 
uninsured lives for insurance purposes. The methods of collecting and 
classifying the data must be reviewed, because, for example, an 
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enumeration process such as employed in the study of 9,000 families 
tends to underestimate the true figures. The period covered by such a 
study has also to be considered in the light of the rapidity with which 
hospital and medical practices are changing. 

Of more importance in viewing studies among uninsured persons is 
the fact that the very presence of insurance benefits materially influ¬ 
ences utilization. A ready example is seen in the growth of hospitaliza¬ 
tion insurance as one of the factors contributing to the overcrowding 
of hospitals. The use of hospital and medical facilities is, to a large ex¬ 
tent, under the control of the individual and his physician. It is true 
that the degree of control is less for hospital confinement and surgical 
operations than for home and oflBice calls or diagnostic x-rays. Never¬ 
theless, an increase in utilization rates is to be anticipated under any 
hospital or medical expense insurance plan; this is a usual consequence 
of the reduction of the cost which would otherwise be a deterrent to the 
demand for services and their use. The greater the degree of control on 
the part of the insured and the doctor, the greater this effect may be 
expected to be. Moreover, experience demonstrates that each group 
insurance coverage tends to produce its own characteristic rates of 
utilization and related statistical factors. For one thing, the claim ex¬ 
perience is affected by the practices of the employers and the insurance 
carrier in administering the insurance contracts. 

After a plan of insurance of the type being discussed has been de¬ 
veloped and has begun to be sold, the next questions are: how is the 
experience to be watched and what statistics are to be compiled? 
Since these plans do not involve the accumulation of funds to offset 
the future cost of deferred benefits, as in the case of life insurance or 
annuities, a simple and very effective index of experience may be 
obtained by a comparison of premiums which become due with the 
claims that are incurred. Such a comparison brings out very readily the 
financial effect of the plan on the company's operations and is easy to 
compute from month to month and to maintain on a current basis. 
HoTvever, such ratios do little to satisfy the need for more detailed sta¬ 
tistics for rate making and other similar purposes. 

Group insurance policies are written for a period of one year and are 
renewable for annual periods thereafter. The premium rates at issue are 
guaranteed for the first policy year but are subject to re-rating upward 
or downward by the insurance company at the end of that period and 
at the end of each succeeding policy year. The policies participate in 
the surplus earnings of the company annually through the declaration 
of dividends or the allowance of retroactive rate credits, where earned. 
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Involved in the computation of dividends or retroactive rate credits is 
the process known as experience-rating, tv hereby weight is given both 
to the claim experience of the individual group policy and to the claim, 
experience of the class of policies to which it belongs. It is clear that in 
order to make possible renewal underwriting and the computation of 
dividends or retroactive rate credits, a record of premiums due and the 
number and amount of claims incurred must be kept separately for 
each group policy for each policy year. 

It is natural then that these individual group experience records 
should be the basis of further statistical analysis within a company. 
Each group policy can be classified in a number of ways: separately 
as to the coverage of employees and family units; according to the 
basic type of coverage it provides; according to the variations in specific 
benefit provisions, such as whether the maximum period of coverage 
for hospital expenses is 31 days or 70 days in any one confinement; 
according to the percentage of female employees covered; according to 
the industry of the employer, and by geographical location. Thus, a 
compilation of data regarding claims and premiums by policy years 
makes possible an extensive statistical analysis. Annual dollar claim 
costs per unit of benefit can be produced, as well as annual claim fre¬ 
quencies per insured person or family, and the average amounts of 
benefits paid per claim. These can be obtained separately for male 
employees, for female employees and for family units. In addition, the 
variation in these factors by industry, policy duration, calendar year 
and geographical location may be studied. 

This further analysis stiU does not give all the statistical information 
necessary for various purposes because it does not involve the classifi¬ 
cation of individual claims other than by the type of coverage and the 
group policy under which they vrere incurred. No information is fur¬ 
nished concerning the variation in claim frequencies, for example, by 
age, by cause of disability or type of surgical operation, or by duration 
of disability or period of hospitalization. Only the appropriate classi¬ 
fication of individual claims upon their completion and the analysis of 
their relationship to the numbers exposed to risk can produce such in¬ 
formation accurately. 

Here arises a question of approach about which there may be some 
difference of opinion among the insuring companies. The question is 
whether there should be set up, on a continuing basis for statistical 
purposes, a system of coding individual claims, involving the use of all 
or a specific fraction of all claims. The alternative is, of course, to de¬ 
velop the basic claim rates and claim costs on a continuing basis but to 
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rely on occasional sample studies designed for specific purposes for any 
additional detailed information which might be desired. 

The merits of the question revolve in good part around the cost of 
the two approaches and can be brought into focus by mentioning that 
currently one company is paying annually about 75,000 hospital ex¬ 
pense claims, 65,000 surgical claims and 45,000 medical expense claims. 
In the writer's opinion, the additional detailed information does not 
warrant the cost of securing it on a continuing basis as against the lesser 
cost of conducting periodic or occasional sample studies. Of course, 
some sample studies would have to be made anyway because the wide 
variety of information desired from time to time would not all justify 
inclusion in any continuous system. 

So far, the discussion has been in terms of the operations of a single 
company. Whenever possible, it is desirable to improve the reliability 
of statistical results by increasing the body of data available for study. 
Thus, the combined experience of a number of companies is not only 
more reliable than that of a single company but it also furnishes a stand¬ 
ard by means of which any one company may judge the eflSlciency of its 
operations. The medium for the compilation and analysis of group 
insurance experience on an intercompany basis is a standing com¬ 
mittee of the Actuarial Society of America known as the Committee 
to Prepare Mortality and Morbidity Studies on Group Insurance. 

The work of this Committee was interrupted by the war, as were 
many other statistical efforts of the individual companies. Accordingly, 
it is only now, after discussions held during the past 18 months, that 
the Committee has completed plans for the compilation of intercom¬ 
pany group morbidity experience. The contributing companies will be 
furnishing the Committee with aggregate “claims” and “exposed to 
risk” for each group policy appropriately classified so as to enable the 
derivation of unit claim costs. It is expected that the Committee's first 
report on group morbidity experience will be published in the 1948 
Tramdctions of the Actuarial Society. For the time being, group 
medical expense insurance is not being studied, largely because of the 
newness of the coverage, the wide variety of plans being written, and 
the diversity in policy provisions. It is hoped that it will be possible to 
commence studying group medical expense insurance on an inter¬ 
company basis at a later date. 

The Committee has also undertaken a sample study based on indi¬ 
vidual surgical claims. The study is designed to supply information 
regarding, first, the relative frequency with which different surgical 
procedures occur and, secondly, the relative values of the fees being 
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charged by doctors for performing such procedures. There is little 
known about either of these matters. A detailed frequency distribution 
of various types of operations should be on hand for the computation 
of premiums for a given surgical schedule in its entirety, as well as to 
enable the computation of the cost of a variation in benefit that might 
be under consideration for a particular surgical operation. 

With respect to doctors’ actual charges, the companies are interested 
not so much in their dollar value since it is not contemplated that the 
surgeon’s fee will necessarily be covered in full by the insurance. The 
companies are concerned instead, mth the relationships which doctors’ 
charges for different operations bear to one another. For, if surgical 
insurance is to be most effective, it is important that the maximum 
amount of reimbursement provided for each type of operation be 
reasonably proportionate to the surgeon’s fee. Thus, it is not desirable 
to have a surgical schedule which might on the average insure the full 
cost of tonsillectomies, for example, while covering only a small fraction 
of the average surgeon’s charge for performing an appendectomy. A 
scale of the relative values of actual charges will make it possible to 
determme whether existing surgical schedules are in accord with the 
pattern of the fees bemg charged by physicians, and will permit the re¬ 
vision of the schedules where they may not be. 

In planning the intercompany study, one of the problems encoim- 
tered was that of devising a suitable classification code for the various 
surgical procedures. A code was desired that would be neither too de¬ 
tailed to be practical nor insufficiently detailed for adequate analysis. 
The information was to be coded from claim papers vrhich do not 
always give complete medical descriptions. Furthermore, the bulk of 
the coding work would be done by clerks whose medical knowledge 
was limited, although the assistance of the insurance company doctors 
would be available. With the advice of several company doctors, a 
satisfactory code containing about 400 classifications was prepared. 

In order that the study of the relative frequencies of various surgical 
procedures be free of seasonal bias, it was necessary that the claims 
included in that portion of the study be chosen proportionately over a 
twelve months’ period. Several of the companies found it inconvenient 
to select claims in that way and so are participating only in the study 
of the relative charges made by doctors. 

In all, punched cards covering about 100,000 claims have been pre¬ 
pared by seven companies. The following information has been sub¬ 
mitted with respect to each surgical claim: (1) the code classification of 
the first surgical operation performed, (2) the classification of a second 
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operation which may have been performed at the same time, (3) the 
total number of procedures performed during the same operation, (4) 
whether the operation was performed in a hospital or elsewhere, (5) 
the claimant’s sex, including whether an employee or dependent, age 
and state of residence, (6) the total amount of the doctor’s charge, and 
(7) the amount of the insurance benefit payable. The data from the 
seven companies have been received by the compiling company and are 
in the process of analysis. The indications are that the study^ 'will pro¬ 
duce much information useful not only to insurance companies but also 
to others. 

In concluding, it should be emphasized that insurance against the 
cost of medical care is relatively new. Much is yet to be learned, es¬ 
pecially about plans insuring doctors’ charges for non-surgical services; 
these are very definitely in the formative, experimental stages. The in¬ 
surance companies are finding out what demand there is for various 
medical services, what it costs to provide such services and w’hat ad¬ 
ministrative problems are involved in covering those costs on a pre¬ 
payment basis. It is hoped that the statistics and experience of the in¬ 
surance companies will contribute valuable knowledge regarding the 
important and difficult problem of distributing and adequately paying 
for medical ser\dces in this coimtry. 

1 See "Group Surgical Claims Study, Report of the Committee to Prepare Mortality and Morbidity 
Studies on Group Insurance,* Tranaactima of the Actuarial Society of America, Vol. 49, 1948. 
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Observations on Cost Utility Rating in Relation to Correlation 
Joseph Berkson, Mayo Clinic 

"Use of Statistical Methods in Human Genetics.” A Test for Monomeric Inr 
heritance of a Normal Trait in Man 
Marianne E. Bernstein, Syracuse University 


—^3:30 p.M.— 

Operational Research and Other Military Applications op Statistics 
(Joint with the Institute of Mathematical Statistics, with the cooperation of 
the Operations Evaluation Group of the Navy and Operations Analysis 
of the Air Force) 
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Chairman: Merrill M. Flood, General Staff, United States Army 

Operations Analysis in the Southwest Pacific Air War 
Roger I. Wilkinson, Bell Telephone Laboratories 
Operations Analysis of Air-Sea Rescue 

E. S. Lamar, Operations Evaluation Group 
Factorial Chi-Square in Test Shooting 
A. E. Brandt, Naval Ordnance Laboratory 
Mathematical Techniques of Program Planning 

George Dantzig, Consultant to the Air Comptroller, Headquarters, United 
States Air Force 

Measures op Construction Activity 
Chairman: Paul Krueger, Bureau of the Budget 
A User’s View 

Leo Grebler, Private Consultant 
The Government’s Estimates of Residential Building 
H. E. Riley, Bureau of Labor Statistics 

Discussion: Howard Whipple Green, Cleveland Health Council 
W. A. Berridge, Metropolitan Life Insurance Co. 

Samuel Dennis, Department of Commerce 
W. G. Shaw, Department of Commerce 
Clyde Shute, F. W. Dodge, Inc. 

Institute op Mathematical Statistics—Business Meeting 

—^5:30 p.M.— 

American Statistical Association—^Annual Busienss Meeting 


—^8:00 p.M.— 

Presidential Addresses 

(Joint with the American Sociological Society) 

Willard L. Thorp, President, American Statistical Association 
Louis Wirth, President, America! Sociological Society 

Tuesdayy December SO, 1947 
—^9:00 A.M.— 

Housing 

(Joint with the American Sociological Society) 

Chairman: Howard Brunsman, Bureau of the Census 

An Appraisal Method for Measuring the Quality of Housing 

Allan A. Twichell, American Public Health Association Committee on the 
Hygiene of Housing 

Problems in Development and Application of Standards of Housing in Making 
Adequate Budgets 

Dorothy S. Brady, Bureau of Labor Statistics 
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Discussion: John P. Dean, Research Fellow, Social Science Research Council 
Nicholas J. Demerath, University of North Carolina 


—10:00 A.M.— 

Dbsckiptive Statistic in the Sebvige op Life Insitbance 

Chairman: Joseph B. MacLean formerly Vice President and Actuary of the 
Mutual Life Insurance Company of New York and past 
President of the Actuarial Society of America 

Life Insurance Investments and the Capital Markets 

Fergus J. McDiarmid, Lincoln National Life Insurance Company 
Mortality Statistics for Life Insurance Underwriting 

Edward A. Lew, Metropolitan Life Insurance Company 
Statistical Problems in New Fields of Insurance 

Morton D. Miller, Equitable Life Assurance Society 


—9:30 A.M.— 

Application of the Theoby op Extbeme Values 

(Joint with the Institute of Mathematical Statistics) 

Applications to Fracture Problems 

Benjamin Epstein, Carnegie Institute of Technology 
Applications to the Prediction of Flood Flows 
Emil Qumbel, Brooklyn College 
The Mathematical Theory of Extreme Valves 
Richard von Mises, Harvard University 
Applications to Meteorology 

Horace Norton, Weather Bureau 


—11:00 a.m.— 

Pbogbess Rbpobt and Discussion op Productivitt in Industbt 
Ernest Dale, Columbia University, American Management Association 
Katherine P. Ellickson, Congress of Industrial Organizations 
W. D. Evans, Bureau of Labor Statistics 
Martin Gainsbrugh, The Conference Board 
Thomas Mills, Bureau of the Budget 

Lazare Teper, International Ladies Garment Workers Union 


—I :00 p.m.— 


Outlook fob 1948 

Chairman: Willard L. Thorp, President, American Statistical Association 

Discussion: Martin Gainsbrugh, The Conference Board 

James Hughes, Auchincloss, Parker and Redpath 
Ragnar D. Naess, Naess and Commings 
Imrie de Vegh, Consulting Economist 


Biombtbics Section—^Business Meeting 
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Statistical Techniques in Life Insurance 

(Joint with the Institute of Mathematical Statistics) 

Chairman: Robert J. Myers, Social Security Administration 

Problems with Sampling Procedures for Reserve Valuations 
George C. Campbell, Metropolitan life Insurance Co. 

Sampling Errors in Life Insurance Mortality and Other Statistics 
Donald Cody, Equitable Life Assurance Society 
Recent Developments in Graduation and Interpolation 
T. N, E. GreviUe, National Office of Vital Statistics 


—3:30 P.M.— 

Census and Vital Statistics 

(Joint with the American Sociological Society) 

Chairman: Edmund de S. Brunner, Columbia University 

Sociologists* Suggestions Relating to the 1950 Census in the U,S. 

P. K. Whelpton, Scripps Foundation for Research in Population Problems 
Comments on Feasibility of the Sociologists* Suggestions 
A. Ross Eckler, Bureau of the Census 
Problems in Revision of the Birth and Death Certificates 
Felix Moore, National Office of Vital Statistics 
Discussion: T. Lynn Smith, Vanderbilt University 

Dorothy Swaine Thomas, Giannini Foundation 

The Use of Statistics by Business Management—A Rounb Table 

Chairman: Isador Lubin, United Nations Labor and Employment Commis¬ 
sion 

Robert W. Burgess, Western Electric Co. 

W. H. Garbade, Shell Oil Co. 

Richard O. Lang, S. C. Johnson & Son, Inc. 

Charles Roos, Econometric Institute 
Clarence Smith, International Business Machines 

American Statistical Association—Contributed Papers 

Factors in the Discontinuance of a Statistical Series 
Harry Alpert, Bureau of the Budget 

Testing for Grade and School Differences Among the Scores of Home Economics 
Students 

William D. Baten, Brooklyn College 
On Measuring Languages for Intematumal Use: On Pan Sampling 
Stuart C. Dodd, University of Washington Public Opinion Laboratory 
Sampling Errors and Components of Interviewer Costs in Survey Design 
Roe Goodman and Charles Cannell, Survey Research Center 
The Sugar Consumption Quota: A Problem in Administration of Legislative 
Controls 

Julius Guttenplan, Sucrest Corp. 

Income, Savings, and Mortgage Investment 
Mortimer Kaplan, Federal Housing Administration 
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Objective Selection of a Respondent within the Household 
Leslie Kish, Survey Research Center, University of Michigan 
Trends in Statistical Activities of Trade Associations 
Sidney Lieberman, Chart Facts 

Maximum Likelihood Estimates of Absolute Dispersion for Samples Drawn from 
Type III and Type V Populations 
Nilan Norris, Hunter College 

Institute op Mathematical Statistics—Contributed Papers 

The Biometric Society, Eastern North American Region—Business 
Meeting 

List of Committee Chairmen and Representatives for 1947 

Committee on Fellows: Joseph S. Davis 
Committee on Nominations: Woodlief Thomas 
Committee on (Graphics: Kenneth W. Haemer 
1947 Annual Meeting Program Committee: Samuel S. Wilks 
International Statistical Conferences ASA Program Committee: Willard L. Thorp, 
Philip M- Hauser, Acting Chairman 
Committee on Census Enumeration Areas: Howard W. Green 
Joint Committee on Occupational Classification: Gladys L. Palmer 
Census Advisory Committee: W. F. Ogburn 

Advisory Committee to BLS on Employment and Statistics Program: S. S. Wilks 
Advisory Committee to the National Housing Agency on Housing Statistics: Howard 
G, Brunsman 

Biometrics Section Committee: D. B. DeLury 
Temporary Committee on the Training of Statisticians: F. G. Cornell 
Representatives on the Committee for the Preparation of a Source Booh of Economic 
Statistics: Walter Mitchell, Jr., J. Frederic Dewhurst 
Representative on the Board of Directors of the National Bureau of Economic Re-> 
search: Frederick C. Mills 

Members of the Social Science Research Council: Joseph S. Davis, Harold Hotel¬ 
ling, Willard L. Thorp 

Representative on the Council of the American Association for the Advancement of 
Science: Frederick F. Stephan 

Representatives on the Joint Committee for the Development of Statistical Applicon 
tions in Engineering and Manufacturing: Churchill Eisenhart, Walter A. 
Shewhart 

RepreserUative on the Sectional Committee on Standards for Graphic Presentation of 
the American Standards Association Sponsored by the American Society of Me- 
chaniccd Engineers: Kenneth W. Haemer 
Representative on the Advisory Board of the American Year Book: Frank W. 
Notestein 

Representatives on the Intersocieiy Committee on Science Foundation Legislation: 

Lester S. Kellogg, Frederick F. Stephan 
Member of the American Documentation Institute: Lester S. Kellogg 



MINUTES OF THE ANNUAL BUSINESS MEETING 

The American Statistical Association convened for its 107th Annual Business 
Meeting at the Hotel Commodore, New York City, Main Ballroom, on Monday, 
December 29, 1947, President Thorp presiding. 

The minutes of the 106th Annual Business Meeting, held in Atlantic City on 
January 25, 1947 were approved. 

Aryness Joy Wickens, acting for Joseph Davis, Chairman of the Committee 
on Fellows, read the following citations of the FeDows elected for 1947: 

The Committee on Fellows has unanimously elected six members as Fellows 
of the American Statistical Association: 

John Hamilton Curtiss, Assistant to the Director of the National Bureau 
of Standards; contributor to statistical theory and to pure mathematics; 
teacher, administrator, and patriotic iimovator in the application of sta¬ 
tistical methods to problems of the Navy at war. 

Milton Gilbert, Chief of the National Income Division of the Bureau of 
Foreign and Domestic Commerce of the U. S. Department of Commerce, 
and formerly Editor of the Survey of Current Business. He has been re¬ 
sponsible for much of the improvement of statistics of gross national product 
and has contributed notably to the analysis of the national income. 

Morris H. Hansen and WUliam Hurwitz. U. S. Bureau of the Census, a dis¬ 
tinguished statistical team which, although among the younger group of 
outstanding members of this society, has made important contributions to 
the use of sampling techniques in the recurrent report of the Census Bureau 
on population and unemployment. They are joint developers of large-scale 
techniques whereby gamblers’ tools can serve solid citizens through efficient 
gathering of pertinent information. 

Jacob Marsckak, Besearch Director of the Cowles Commission and Pro¬ 
fessor of Economics at the University of Chicago; born in Russia, with 
academic degrees from Heidelberg and Oxford, he has made valuable per¬ 
sonal contributions and exercised constructive leadership in statistics, 
economics, and econometrics in England and in the United States. 

Herbert Marshall, Dominion Statistician of Canada, Chairman of United 
Nations Statistical Commission, noted for his responsible leadership in and 
important contributions to Canadian statistics, especially those on the 
Canadian international balance of payments and on Canadian-American 
direct investments; and for his services in the use of statistics in the control 
of his country’s economy during a great war. 

Lester S. Kellogg, Secretary of the Association, reported briefly on the current 
status of finances and membership of the Association. 

The following resolution, submitted by A. Ford Hinrichs, was read: 

RESOLVED that: 

1. The President of the American Statistical Association shall arrange in 
consultation with the President of the American Economic Association that 
the annual meetings in 1949 and later years shall be held at a common time 
and place. 

2. In order to contribute to a unified attack on common problenusi, other 
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societies of the social sciences should be invited to join in the arrangements 
for a common meeting place. 

3. The President of the American Statistical Association shall appoint a 
Committee not later than March, 1948, to arrange a program on the 
PROBLEMS OF THE STATISTICAL PROFESSION and of a unified 
attack on problems in the field of the social sciences, to constitute the core 
of the sessions of the annual meetings in 1949. The Committee shall have 
discretion as to the contents and arrangement of this program, but it is de¬ 
sirable that it should include a discussion of: 

a) professional standards and the professional requirements of various 
fields of work and types of occupations in which statisticians are com¬ 
monly engaged. 

b) training requirements and methods, with reference not only to profes¬ 
sional statisticians but also with reference to the elementary levels for 
citizenship and active participation in a democracy. 

c) kind of professional organization needed to encourage the development 
within the total profession of groups of specialized professional interests. 

d) methods of presenting the findings of the profession to its members and 
of presenting these findings and representing the profession to the public. 

e) proposals for the development of a better coordinated attack by the 
profession on unsolved problems of statistical theory, including 

i) the development of a moving 5-year program to establish the goals 
of the Association itself, 

ii) a continuing analysis in each field of theory of the gaps in the data 
or in professional knowledge, 

iii) arrangements to encourage research and analysis particularly at 
these critical points, 

iv) arrangements at the annual meetings to present a comprehensive 
brief view of all new advances to the profession, with special refer¬ 
ence to advances in statistical theory, in the measurement of relation¬ 
ships between variables, in data and in techniques of analysis. 

While this program shall constitute the core of the annual meeting in 
1949, it is not intended to exclude other types of discussion. 

4. The Committee so appointed shall invite the participation of the Social 
Science Research Council in its deliberations and program arrangements 
and shall consult with committees that may be appointed for similar 
purposes by other associations of the social sciences. As far as possible, 
joint programs shall be arranged with such other associations to deal with 
common or similar problems. 

This resolution was referred to the Board of Directors for its consideration. 
(A similar resolution was presented at the annual meeting of the American 
Economic Association and the American Sociological Society.) 

Willard L. Thorp, President reporting for the Committee on Structure, stated 
that the draft of the new Constitution was in process of revision and would be 
submitted to the members within a short time. He informed the membership 
that through an oversight of the Board of Directors a proposed amendment to 
the Constitution, approved by the membership in 1940, was never resubmitted 
to the membersMp because of the failure to hold annual meetings during the 
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recent world war. This report was noted as a matter of information and no 
additional vote was taken. 

In the absence of Francis G. Cornell, Chairman of the Temporary Committee 
on the Training of Statisticians, Helen M. Walker, a member of the Committee, 
reported that the Committee had organized several sessions at the Annual 
Meeting which had been very well attended. Over 70 people present at these 
sessions on training signed a petition for the formation of a Section on the 
Training of Statisticians, and Professor Walker announced that an organization 
meeting to form the Section would be held the following day. 

Samuel S. Wilks reported that a plan had been worked out for the publication 
of a new periodical to be sponsored by the Association, to be called “Statistical 
Reviews.” The publication will begin its appearance in 1949 and will carry book 
reviews, abstracts, and the Statistical Methodology Index which at present 
appears in the Journal. He reported that plans had been completed and were 
ready to submit to a foundation with a request for subsidy. 

Dr. Wilks also reported on the activities of the Committee to advise the Bureau 
of Labor Statistics on its employment statistics program. This Committee pre¬ 
pared a detailed report which included a number of useful recommendations and 
was submitted to the Commissioner of Labor Statistics in August of 1947. 
Aryness Joy Wickens, of the Bureau of Labor Statistics, expressed the thanks of 
the Bureau for the service rendered by the Committee. 

Willard L. Thorp presented the following slate of officers prepared by the 
Nominating Committee: 

George W. Snedecor, President; 

Frederick F. Stephan and Willard L. Thorp, Directors for the years 1948, 
1949 and 1950; 

A. Ford Hinrichs, Rensis Likert, Abraham Wald, and Harry R. Wellman, 
Vice-Presidents; 

Lester S. Kellogg, Secretary-Treasurer. 

The vote in favor of the slate was unanimous and the Secretary was instructed 
to cast one ballot for the entire list. 

Willard L. Thorp, outgoing President, turned the Chairmanship of the meeting 
over to the newly elected President, George W. Snedecor. Professor Snedecor 
expressed his pleasure at the honor conferred on him and his appreciation of the 
diligent and unremitting efforts of Lester S. Kellogg as Secretary of the Associ¬ 
ation. 

The following resolution 'was presented. It is RESOLVED that the Officers 
and the members of the Association express their appreciation to the New York 
Chapter for its help in making such excellent arrangements for the Annual 
Meeting. The thanks of the membership are particularly due to Waite S. Brush, 
District Representative of the New York Chapter and Chairman of the local 
Committee on Arrangements, for his effective assistance. 

It was also RESOLVED to commend Waite S. Brush for the remarkable ex¬ 
pansion of actmties in the New York area during the past year. 

Solomon Barkin presented a resolution stating that “The American Statistical 
Association should investigate its responsibilities in the held of industrial sta¬ 
tistics.” This resolution was referred to the Board of Directors for action. 

Waite S. Brush reported that there had been a tremendous increase in activities 
in the New York area. He reported that the Socifd Statistics Section and the Sta- 
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tistical Techniques Section of the Chapter had been meeting regularly, that a 
number of full Chapter Meetings had been organized and well-attended during 
the course of the past year. The Chapter has been occupied with the statistical 
problems of the New York area and has been working ^ ith municipal and state 
committees to determine statistical requirements in New York and in helping to 
plan means of making additional statistical data available. Mr. Brush appealed 
for help from the national Association in obtaining the services of a paid secretary 
and asked the attention of the President and the Board of Directors for his re¬ 
quest. 

The meeting was adjourned. 

Lester S. Kellogg, Secretary 
Report of the Board of Directors 

The year 1947 witnessed a continuing growth of the American Statistical 
Association and an expansion and ranuheation of its activities. The dues-paying 
membership increased from approximately 3,800 to about 4,800. Three new chap¬ 
ters were organized, in Honolulu, Ithaca and Oklahoma City. Negotiations were 
begun for the chartering of additional chapters, two of which were formally 
organized during the first quarter of 1948. 

During September 1947, many of the world^s leading statisticians met in 
Washington for the sessions of the International Statistical Conferences. Five 
hundred scientific workers in statistics and allied fields attended and all major 
countries, with the exception of the U.S.S.R., sent delegates. The groundwork 
was laid for the resumption of that fruitful international cooperation among 
scientists which characterized the pre-World War I and inter-war periods. 

The Association was one of the sponsoring organizations of this gathering. In 
addition to giving a reception at wMch the foreign delegates vrere able to renew 
social and intellectual contacts which had been broken during the War, the ASA 
organized three of the sessions of the Conferences. A special issue of the Journal 
was prepared for the international meetings and distributed gratis to the foreign 
delegates. A non-commercial graphics exhibit was organized to familiarize non- 
United States delegates with the presentation methods developed in this country 
over the past seven years. Through the cooperation of publishing firms and sta¬ 
tistical machine manufacturers, a commercial display was provided which gave 
comprehensive coverage of textbooks, publications on methodology and spe¬ 
cialized fields and mechanical aids to statistical work, ranging from simple com¬ 
puting machines to highly advanced and specialized types. The exhibit produced 
a net income of $3,286.20, which was made available to the Joint Arrangements 
Committee to help in defraying expense incident to conducting the Conferences. 

Recognizing that physical destruction of foreign libraries, doUar exchange 
shortages and other factors have conspired to deprive foreign statisticians and 
statistical institutions of the tools necessary to their work, the Association made 
available complete sets of the ASA Journal for the past 15 years gratis to approxi¬ 
mately 25 war-devastated libraries. Current books on statistical developments 
and methodology were also made available to needy foreign statisticians, scholars 
and schools. 

The International Statistical Conferences in Washington resulted in the or¬ 
ganization of several new specialized groups of statisticians and persons working 
in closely related fields. International groups were launched to study national in- 
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come and wealth, public opinion sampling, biometrics and demography. In all 
these new ventures, members of the Association have played an active role. 

The papers delivered at the Conferences are to be published in five volumes. 
The Association is editing the contributions made to the three sessions which it 
organized. Noteworthy papers of special interest to the membership will be 
published separately in forthcoming issues of the Journal, 

In August 1947, the first issue of The American Statistician appeared.This 
bi-monthly periodical, which supplants the organizational Bulletin of the Associ¬ 
ation, was planned as a professional news magazine. It carries brief reports of 
important statistical news, discussions of professional problems, and articles on 
the activities of statisticians. The magazine is designed to reach not only the 
members of the Association but that large group of students and users of sta¬ 
tistics who are interested in the activities of the profession. Its circulation at the 
close of 1947 was around 5,400. 

The 107th Meeting of the Association was held in New York City at the 
Hotel Commodore in December 1947 in conjunction with the American Socio¬ 
logical Society, the Biometric Society and the Institute of Mathematical Sta¬ 
tistics. Despite the New York blizzard and the meetings of the American Eco¬ 
nomic Association and of Section K of the AAAS simultaneously in Chicago, the 
New York meetings drew an attendance of over 1,000 persons. A feature of the 
New York meeting was the comprehensive display of statistical presentation 
methods prepared for the Association by its Committee on Graphics under the 
chairmanship of Kenneth W. Haemer. 

Further discussion of the new Constitution of the Association occurred at the 
December meeting in New York. The document discussed at that meeting was 
sent back for redrafting. This redraft was accepted, by mail ballot of the member¬ 
ship, in early 1948. 

Plans for a joint Annual Meeting between the American Economic Association 
and the ASA in Cleveland in the last week of December 1948 have been endorsed 
by both organizations. Tentative arrangements for a joint annual meeting on the 
West Coast in 1949 or 1950 are currently under consideration. 

The central of^ce of the Association in Washington acquired additional space 
and staff in 1947 to meet the requirements of continuing ASA growth. At the 
close of 1947, Lester S. Kellogg, secretary-treasurer of the Association during the 
period of rapid expansion of the war and post-war years, accepted an executive 
position with Deere and Company in Moline, Illinois. In view of his absence 
from Washington, Sylvia C. Weyl was named Acting Secretary-Treasurer on an 
ad interim basis and a committee was appointed to study the question of finding 
a successor to Dr. Kellogg. 

The development of the statistical field has required increasing activity and 
responsibility on the part of the Association. There is a demand for more publica¬ 
tions, more meetings with other professional groups, more professional represen¬ 
tation, an employment service and more committee activity. These activities 
require more resources and an expanded secretarial staff. Costs of printing and 
other services have about doubled since the end of the war. The effects of infia- 
tion have been partly offset by the growth in membership and subscriptions and 
the contributed secretarial and editorial services. The old membership rate of 
$5.00 a year was found to be inadequate to cover basic expenses much less to 
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permit expanded activity. In view of this situation the membership was asked 
to increase the rate to $8.00—^which was voted in February 1948. 

The Biometrics Section of the ASA participated actively in the International 
Conferences and in the organization of The Biometric Society, the new interna¬ 
tional organization of biometricians. The publication Biometrics completed its 
first year as a quarterly and was requested to print the Proceedings of the First 
International Biometric Conference. 

The Temporary Committee on the Training of Statisticians, under the chair¬ 
manship of Francis G. Cornell, made a report to the Board. As a result of this 
report, a Section on the Training of Statisticians was organized within the ASA 
and is charged with the formulation of standards for teaching statistics and vvith 
the preparation of special sessions at the Annual Meeting in 1948. 

The Association has been actively involved in preparations for the 17th 
Decennial Census of the United States. The Committee to Advise the Bureau 
of the Census, which has the responsibility of reviewing all requests from the 
general public for the inclusion of additional information in census data, has been 
working as a clearing house between statisticians and government to determine 
the content of census questionnaires. 

The Advisory Committee to the Bureau of Labor Statistics on the Employ¬ 
ment Statistics Program submitted a comprehensive and final report in 1947 
which was summarized in the April 1948 issue of The American Statistician, 

The year has been one in which the Association has grown, its responsibilities 
have increased and the services it renders to the statistical profession have ex¬ 
panded. The influence of ASA publications has been notably enhanced. Thus, the 
circulation of the Journal gained by approximately 20 per cent to reach 5,895. 
The American Statistician is reaching a broad group of less technically and mathe¬ 
matically minded readers, familiarizing them with general developments in the 
statistical field. In recognition of the probability that the Association must in the 
coming years provide still broader services, a decision has been taken to launch a 
new publication, under the editorship of Oscar K. Buros, probably in 1949, This 
new organ will cover the field of book reviews, will provide an index of statistical 
methodology, and will make available a comprehensive digest of all significant 
articles, both domestic and foreign, which deal with statistical problems. The in¬ 
creasing reliance of science, industry, business and government on statistical 
method, coupled with the continuing development of the science of statistics 
itself, confronts your Board with the problems incident to endeavoring to meet 
the unsatisfied needs of a corps of workers in an expanding science. 

Lester S. Kellogg 
ISAJIOB LtTBIN 
Lowell J. Reed 
Walter A. Shewhart 
Samttel A. Stouffer 
Willard L. Thorp 
Helen M. Walker 
Samuel S. Wilks 
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The Secretary'e Report on Membership 

During 1947 membership in the American Statistical Association increased by 
1,028, bringing the total on December 31, 1947 to 4,871. Three new corporate 
members joined the Association. The continued growth during the past year 
indicates the growing interest in the Association’s activities and a large potential 
membership. 

The 1947 membership is composed of the following groups: 


Honorary members. 15 

Fellows. 142 

Regular members. 4,714 


Total Membership. 4,871 

Corporate Members. 6 


There were 82 resignations in the course of the year, and 189 members were 
removed from the rolls for failure to pay dues. 

The members of the Biometrics Section, at the end of December, 1947, num¬ 
bered 1,060, of which 262 Trere associate members and 798 were also members of 
the ASA. 

The deaths of the following members were recorded during the year: Irving 
Fisher, Fellow; Charles A. Bauer, L. A. Beers, H. M. Biggert, Raymond Car¬ 
penter, Charles Cresswell, Kenneth Duncan, Albert M. Freeman, Thomas B. 
Graham, Otto Jeidels, James E. Kelley, Sara A. Kerr, H. G. MacCall, Joseph 
P. McEJroy, Hans NyboUe, Henry A. Ruger, James G. Smith, Philip Sweet, 
Henry A. Van Pelt, Regular Members, 

Lester S. Kellogg, Secretary 
Report of the Treasurer 

The 1947 budget as approved by the Board of Directors planned for an income 
of 832,535.00 and expenditure of $35,192.00. At the end of the year actual income 
was $37,014.87, and our expenditures came to $42,134.72. 

The increase in income has come largely from new memberships, increases in 
subscriptions and additional advertising in the Journal, and was substantial 
enough to permit the publication of a greatly enlarged December issue of the 
Journal. Income from Biometrics has increased, and it is expected that its income 
for the next few years will continue to expand as its service becomes more widely 
recognized. 

The national office is planning a new drive for memberships and subscriptions 
this year, and the budget will consequently be substantially increased. 

Lester S. Kellogg, Treasurer 
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Report of the Auditors 

To the Board of Directors of 

American Statistical Association 

We have examined the attached financial statements of American Statistical 
Association relating to the year ended December 31, 1947. Our examination was 
made in accordance with generally accepted auditing standards, and included 
such tests of the accounting records and other supporting evidence and such other 
procedures as we considered necessary in the circumstances. 

The recorded cash receipts for the year were traced to the deposits shown on 
the bank statements and the amounts for dues and subscriptions were tested with 
the membership and subscription records. The paid checks were inspected and 
related vouchers tested in support of cash disbursements for the year. The bank 
balances were reconciled with amounts reported direct to us by the depositaries 
and the cash on hand and the securities owned at December 31, 1947 were veri¬ 
fied by inspection. We did not check the membership and subscription records in 
detail or make any independent verification of the inventory of old Journals, the 
office records of which are based, in part, on data assembled in prior years, no 
recent physical inventory having been taken. 

The life membership reserve is computed on the basis of the combined annuity 
table of mortality with assumed interest at 4% per annum and an assumed an¬ 
nuity of $5.00 per life member, in accordance with a resolution of the Board of 
Directors on March 31, 1936. The amount treated as income in each year repre¬ 
sents the excess of the reserve at the beginning of the year plus interest for the 
year and new life membership receipts over the required reserve at the end of 
the year. 

In our opinion, the accompanying statements present fairly the position of 
American Statistical Association at December 31, 1947 and the results of its 
operations for the year, in conformity with generally accepted accounting prin¬ 
ciples applied on a basis consiatent in all material respects with that of the pre¬ 
ceding year. 


Price, Waterhouse & Co. 



American Statistical Association 
Balance Sheet 


December 31, 

Assets mr me 

Cash in bank and on hand. SI, 853.91 $12,475.91 

Accounts receivable. 1,356.22 774.19 

Investments: 

United States Savings Bonds, Series D, at redemp¬ 
tion value. 6,138.00 5,910.00 

Stocks, at cost (at market quotations $6,150 and 

$6,358, respectively). 5,793.50 5,793.50 

Inventory of old Journals, at approximate cost. 2,024.63 2,158.04 

Furniture and equipment, at cost less depreciation... 2,482.76 847.11 


$19,649.02 $27,958.75 


Liabilities 

Accounts payable. $ 6,515.76 $ 3,537.45 


Deferred income (collections applicable to subse¬ 
quent year): 

Dues. $ 1,054.50 $ 7,260.67 

Subscriptions. 2,704.76 3,158.48 

Other. 115.35 


$ 3,874.61 $10,419.15 

Net worth: 

Life membership reserve. $ 3,750.20 $ 3,377.35 

Centenary Sustaining Fund. 6,376.01 

Surplus, per statement. 5,508.45 4,248.79 


$ 9,258.65 $14,002.15 
$19,649.02 $ 27,958.75 


American Statistical Association 
Surplus Statement 

Year ending December 31, 

1H7 me 

Balance at beginning of year. $ 4,248.79 $ 4,594.20 

Add —^Transfer of balance in Centenary Sustaining 
Fund. 6,379.51 


$10,628.30 $ 4,594.20 

Deduct —Excess of expenses over income for the 
year, per income statement. 5,119.85 345.41 


Balance at end of year. $ 5,508.45 $ 4,248.79 
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American Statistical Association 
Income Statement 

Year ending December SI, 
1947 1946 

Income: 

Dues—current year. $21,298.58 $17,452.50 

Dues—^prior j-ears. 215.00 92.00 

Life memberships. 397.33 352.47 

Subscriptions. 6,206.95 5,341.30 

Advertising. 2,010.08 1,700.88 

Reprints. 509.00 530.93 

Journal sales. 2,827.80 793.80 

Biometrics Section income. 2,466.86 2,121.97 

Miscellaneous. 684.70 394.71 

Dividends and interest (after deducting $135.09 
in 1947 apportioned to life membership re¬ 
serve). 398.57 432.08 


$37,014.87 $29,212.64 


Expenses: 

Journal —^printing, mailing and reprints. $10,105.81 $ 7,908.35 

Bulletin. 4,259.88 991.01 

Salaries and wages. 17,037.26 11,876.44 

Rent. 1,365.00 1,020.00 

Office supplies, printing and mimeographing.... 1,764.42 1,277.44 

Postage. 1,511.11 1,060.07 

Telephone and telegraph. 512.03 369.30 

Travel expense—officers. 603.91 287.29 

Biometrics Section expenses. 2,366.46 2,185.81 

Membership Directory expense. 1,575.76 

Storage of old Journals . 72.00 72.00 

Cost of old Journals sold. 382.27 58.69 

Depreciation of furniture and equipment. 324.99 149.49 

Annual meeting expense (net). 647.89 238.84 

Loss on disposal of equipment. 151.09 

Miscellaneous. 1,030.60 487.56 


$42,134.72 $29,558.05 


Balance, loss, carried to surplus. $(5,119.85) $ (345.41) 
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Edited by 

OSCAE Krisen Buros 

Rutgers University 

Statistical Methods in Research and Production: With Special Reference to 
the Chemical Industry. Edited by Owen L. Dames. Written by Lewis R. Connor, 
Wilfred R. Cousins, Owen L. Davies, Francis R. Himsworth, Harold Kenney, 
Wilfred L. Stevens and Charles C. Tanner. (Chemists, Engineers, and Statisticians 
emploj'ed by Imperial Chemical Industries Ltd., Manchester, England.) Pub¬ 
lished for the Imperial Chemical Industries Limited. Edinburgh 1, Scotland: 
Oliver and Boyd Ltd. (Tweeddale Court), 1947. Pp. xii, 292. 28s. 

Review by D. B. DeLurt 
epartment of Mathematical Statistics 

AND 

A. E. R. "Westman 
Director, Department of Chemistry 
Ontario Research Foundation, Toronto S, Canada 

T he object of the authors is to put into the hands of the industrial research 
worker a handbook prepared especially for him in terms with which he is 
likely to be familiar. To this end, the main body of the discussion is given 
in verbal and arithmetical terms. Whatever algebraic developments are 
needed appear in appendices at the ends of chapters. This is not to imply 
that the book is chiefly a guide to effective computational methods.Emphasis 
is placed on the experimental situation and much careful and thoughtful 
explanation accompanies each example. 

The range of topics covered is indicated by the chapter headings: Fre¬ 
quency Distributions, Averages and Pleasures of Dispersion, Tests of Sig- 
niflcance, Analysis of Variance, Regression and Correlation, Frequency 
Data and Contingency Tables, Sampling, Control Charts, Prediction and 
Specification. 

Each chapter starts with a general description of the questions with which 
it deals, followed by examples based on data (usually drawn from the 
chemical industry) which are intended to illustrate the type of situation with 
which the chapter deals. The discussion of the examples is not obscured by 
arithmetical details; systematic methods of computation are dealt with in 
chapter appendices. 

This plan is, no doubt, effective, but starting as it does with the data 
rather than with the gathering of the data it tends to cover up the importance 
of statistical considerations in the planning of the experiment. Perhaps each 
chapter should include an example in which statistical techniques can ex¬ 
tract no worth-while information from the data! 

Those who are interested chiefly in practical applications will find much 
useful information gathered together in this handbook. On the whole it is 
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clearly presented and illustrated by good examples. The reader who has a 
knowledge of theoretical statistics will doubtless find many points on which 
he does not agree with the authors and others may find some of the discus¬ 
sions confusing. Certainly the quality of the book is uneven with respect to 
both clarity of exposition and accuracy of statement. While many of the 
discussions and explanations are excellent, others are obscure, sometimes 
inaccurate and occasionally meaningless. For example, it is not likely that 
anyone will be helped to a clearer understanding of statistical estimation by the 
statement that “Statistical estimation consists, in general, of postulating that 
a given set of data or sample was drawn from a Universe of a certain form. 
The constants or parameters required to define the Universe may be estimated 
from the data or may be postulated” (p. 14). 

Owing, perhaps, to reluctance of the authors to expand the book to greater 
length, some topics are treated too briefly. Only one model for the analysis 
of variance is explained, that in which differences among cell averages are 
regarded as arising entirely from random sources. The case of “fixed effects” 
is not discussed, although these methods are used to exhibit, for example, 
differences among three “gaugers,” who could hardly be considered a ran¬ 
domly chosen sample from a population of gaugers, in a test of the com¬ 
pressive strength of Portland cement (Example 5.5), and the difference be¬ 
tween two operators who carried out simultaneous determinations of the 
percentage of ammonia in a plant gas (Appendix 4B). 

The chapter on regression suffers considerably from the failure to discuss 
the underlying statistical model. The conditional nature of the regression 
l)roblem never emerges. Instead, there is created a hierarchy of regression 
problems, the two extremes of which are, apparently, (it) the case in which 
the quantities observed are inherently variable, in a statistical sense; (ii) the 
case in which the quantities observed are not statistical variables; the only 
random variation in this case comes from errors of measurement. This is 
described as the case in which the variables are related by an exact mathe¬ 
matical formula. 

The choice of examples to typify these two extremes is unfortunate. Case 
(t) is discussed in terms of the relationship between hardness and abrasion 
loss in rubber. The observations on these two variables are made on samples 
of rubber in which the amounts and t 5 pes of ingredients are deliberately 
changed from one sample to another, a situation which would seem to 
require the use of multiple regression methods. Case (w) is illustrated by the 
dependence on concentration of the response to a penicillin solution. It may 
be granted that the concentrations are known without appreciable error, but 
the variations in response represent genuine biological variation and it is 
diflicult to envisage an “exact” relationship between response and concen¬ 
tration. In any case, the differences between these two cases largely disap¬ 
pear when the true nature of regression is recognized. 

The distinction between these types is maintained throughout, culminat¬ 
ing in two different formulae for the standard error of a prediction based on 



320 


AMERICAN STATISTICAL ASSOCIATION 


a regression equation (Formulae 6.3 and 6.31). It is not clear what is being 
predicted, a mean value or the value of a future observation, but whichever 
is intended, the same formula (6.3 or 6.31) should apply in both case (0 and 
case (w)- 

More serious is the failure to bring together regression methods and the 
analysis of variance, so as to present the elements of the analysis of covari¬ 
ance, a technique second in importance only to the analysis of variance in all 
fields of experimentation. 

Inevitably a few slips must creep into every book. The following may be 
noted, 

(1) The F test is introduced in a comparison of variance estimates from 
two independent samples, a situation in which a two-tail test usually is indi¬ 
cated. However, the test employed is one-tailed (Example 4.51). 

(2) Throughout the discussion of the errors of prediction which result from 
the errors of the regression coefficients (p. 130), a should be replaced by % 

Readers interested in control chart techniques will find a clearly written 
chapter on the subject based on British Standard practice.^ However, only 
the charts for average and range are discussed in detail and the unfortunate 
impression is given by some parts of the discussion and by the numerical 
example that a subgroup of four or five is necessarily “rational.” In the 
chapters concerned with other techniques, not much attention is paid to 
the quality control approach to industrial problems and readers familiar 
with developments in America will consider the following statement on page 
4 of the Introduction to be overcautious: “Its application in the wider field 
of the chemical industry is still a matter of some doubt, but in view of its 
striking success in other fields, it is felt that the possibility of its application 
to chemical plants should receive careful consideration.” 

The book, as it stands, should be most useful to the industrial investigator. 
The intention of the authors, stated in the preface, to keep the handbook up 
to date, will furnish an opportunity for the critical editorial overhauling 
which is needed to bring this work to a uniformly high level of excellence. A 
sequel on the planning of experiments is contemplated. 

The typography is exceptionally good. Misprints are rare and unim¬ 
portant. 

* Review by E. L. Lehmann 

Assistant Professor of Mathematics 
University of California, Berkeley, California 

A TJTHORS of any book such as the present manual who, without assuming 
xXany previous statistical knowledge on the part of their readers, want to 
give an introduction to intelligent use of statistical methods are concerned 
with three different aspects of statistics, (a) They must explain the basic 
concepts such as probability, expectation, statistical hypothesis, power of a 

1 Dudding, B P., and Jennett, W. J. QtiolUu Corardl CharU. British Standard 600B: 1942. London: 
Bzituh Standards Institution, 1942. P. 89. 
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test, confidence intervals, etc. (5) They must describe the most important 
statistical procedures, state the conditions under which these procedures are 
known to be valid, and discuss some of the computational problems that 
arise, (c) They must illustrate the methods with representative examples 
from the field of application with which they are concerned. 

In the present volume, as in most other books of this kind, (b) and (c) 
are emphasized at the expense of (a) and even in the discussion of particular 
procedures the underlying assumptions are frequently indicated only rather 
vaguely. This seems regrettable since, as Professor Wolfowitz pointed out in 
a recent review in this Journal (41: 127-8 Mr ’46), “misapplication of 
statistical methods is more likely to be serious than the use of inefficient 
computing techniques.” In the discussion of hypothesis testing, for example, 
no mention is made of possible alternatives or, therefore, of power. As a 
result the assumptions underlying a test are never quite clear. For instance, 
when the authors explain their recommendation of a two-tailed F-test in 
analysis of variance, the class of alternatives which they have in mind seems 
to change during the discussion. 

The material covered in the book is fairly conventional. Two introductory 
chapters on frequency distributions, and averages and measures of dispersion 
are followed by a discussion of tests of significance (t-test and F-test—^the 
authors make the common slip of using a one-tailed F-table for a two-taUed 
test without adjusting the level of significance). Then come the two main 
chapters on analysis of variance, and regression and correlation, and chapters 
on frequency data and contingency tables, sampling, and control charts. At 
the end are a few pages on prediction and specification, which are mainly 
concerned with the Camp-Meidell and Tchebycheff inequalities. 

There is a somewhat incomplete glossary of statistical terms and an un¬ 
usually comprehensive collection of tables (area under normal curve, per¬ 
centage points of x®» t and F; values of correlation coefficient for different 
levels of significance; limits of the expectation for binomial and Poisson 
distributions; and control chart limits for average and range). A nice feature 
of the book is that the methods are illustrated throughout with many inter¬ 
esting examples from the chemical industry. 

The authors state that they intend to keep the handbook up to date and 
indicate that they hope to include some sequential techniques in a later 
edition. It would perhaps be useful also to include some nonparametric 
methods. 


Selected Techniques of Statistical Analysis: For Scientific and Industrial Re¬ 
search and Production and Management Engineering. By the Statistical 
Research Group, Columbia University, Applied Mathematics Panel, Office^ of 
Scientific Research and Development. Effited by Churchill Eisenhart (Chief, 
Statistical Engineering Laboratory, National Applied Mathematics Laboratories, 
National Bureau of Standards, Washington, D. C.); Millard W. Hastay (Re¬ 
search Staff, National Bureau of Economic Research, New York 23, N. Y.); and 
W. AUen WaUia (Professor of Statistics and Business Economics, IJniversity of 
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Chicago, Chicago 37, Ill.). New York 18: McGraw-Hill Book Co., Inc. (330 West 
42nd St.), 1947. Pp. xiv, 473. $6.00. (London W.C. 2: McGraw-Hill Publishing 
Co., Ltd. [Aldwych House, Aldwych], 1948. 36s.) 


Review by Henry ScedeffiS 

Associate Professor of Engineering, University of California 
Los Angeles, California 

T his book consists of 17 quite unrelated chapters dealing with various 
problems which arose in projects undertaken by SRGC. Almost anyone 
confronted with practical problems calling for statistical methods may be 
assured of finding in this book material of value to him, providing he has 
some background of statistical knowledge, perhaps the amount acquired 
in two year courses in statistics. A very thorough index of 43 pages of fine 
print increases the usefulness of the book. It would not be suitable for a text¬ 
book for various reasons, including the specialization of the subject matter. 

Since a summarj^ of the book by chapters would take too much room here 
we shall list the titles and number of pages for each chapter, and add a few 
remarks. Part I, Industrial Statistics, consists of chapters 1 through 6; 
Part II, Planning Experiments, of chapters 7 through 13; and Part III, 
Techniques and Tables, of chapters 14 through 17. The chapters are: 


1. Use of variables in acceptance inspection for per cent defective (92 pp.). 

2. Tolerance limits for normal distributions (16 pp.). 

3. Multivariate quality control, illustrated by the air testing of sample 
bombsights (74 pp.). 

4. Effects of rounding or grouping data (40 pp.). 

5. Acceptance inspection when lot quality is measured by the range 

(10 pp.). 

6. Variability of amount of inspection for double, multiple, and sequential 
sampling (10 pp.). 

7. Planning and analyzing experiments for comparing two percentages 




8. Planning and interpreting experiments for comparing two standard 
deviations (52 pp.). 

9. Utilization of limited experimental facilities when the cost of each 
measurement depends on its magnitude (10 pp.). 

10. Experiments in which the independent variable is known only in 
terms of a probability distribution (10 pp.). 

11. Planning an experiment for estimating the mean and standard devia¬ 
tion of a normal dmribution from observations on the cumulative dis¬ 
tribution (14 p]^.). 

12. Abandoning an experiment prior to completion (10 pp.)* 

13. Planning experiments seeking maxima (10 pp.). 

14. Probability that sample means are in opposite order to population 
means (8 pp.). 

15. Significance of the largest of a set of sample estimates of variance 
(12 pp.). 

16. Inverse sine transformation of proportions (22 pp.). 

17. Elements of sequential analysis (13 pp.). 


The chapters were written by Albert H. Bowker, Churchill Eisenbart, 
Milton Friedman, Millard W. Hastay, Harold Hotelling, Edward Paulson, 
L. J. Savage, Herbert Solomon, and W. AUen Wallis. Some of the chapters 
are based on work credited to the following people not included in the above 
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list of writers: Keniieth J. Arnold, M. A. Girshick, Myra Levine, A. Wald, 
and J. Wolfowitz. 

Among the nonuniform aspects of the chapters is the mathematical prep¬ 
aration assumed on the part of the reader: At one extreme we find 7 pages 
devoted to computational methods for the evaluation of x+ks, where x 
is the sample mean, s, the sample standard deviation, and k, a given constant 
(this may be justified by the resistance to the use of s in certain industrial 
quarters), at the other extreme there is mention of a set of measure zero 
(in a “technical note”)* Mostly the authors have conscientiously tried to 
meet the needs of the “practical man” for whom the book was written, and 
if he happens to be also a mathematician he may occasionally find somewhat 
painful* the verbal arguments where he would use symbolic. If unifying 
motifs are sought in this work, one is to be found in the wide use of the 
concept of the power function or operating characteristic of a statistical 
test; this occurs in chapters 1, 5, 6, 7, 8, 12, 14, and 17. The reviewer con¬ 
siders one of the main virtues of the book to be the influence it will have in 
making this concept a tool of the practitioner. 

The book brings to the reader a large amount of material which has 
been available before only in periodicals. Not even thus available has been 
much of the content of chapter 3 on multivariate quality control. Aside from 
the novelty of applying the retype of statistic to quality control problems, 
we note also the following innovations: The distance apart of the control 
lines on the control charts depends on the length of the base period from 
which they are calculated (the distance decreasing with increasing length). 
“Yardsticks” are introduced on the control charts which allow us to judge 
whether the difference of the ordinates of two points is statistically signifi¬ 
cant. Where the ratio rather than the difference is statistically appropriate, 
the yardstick device is retained by use of a logarithmic vertical scale. 

It is clear that this group did some very fine statistical work. For their 
efforts in rescuing some of the results of permanent value from burial in the 
files the director of the group, Professor Wallis, and the other editors deserve 
congratulations. 


The Design of Experiments, Fourth Edition, Ronald A. Fisher (Arthur Balfour 
Professor, University of Cambridge, Cambridge, England). Edinburgh 1, Scot¬ 
land: Oliver and Boyd Ltd. (Tweeddale Court), 1947. Pp. xi, 240. 12s. 6d. 

Kbview by a. E. Brandt 
Statistical Consultard, Office of Technical Director 
Naval Ordnance Laboratory, Washington, D. C. 

T he reviewer has enjoyed the opportunity to appraise the fourth edition 
of The Design of Experiments after a number of years of teaching and 

* Also that the one formally stated theorem in the book (p. 409) is possibly false, the proof cer¬ 
tainly is, and in the two applications to discrete distributions the conditions of the theorem areobvbu^ 
not satisfied [condition (14)]. Among the errors in the proof is the unjustified equating of E(XF} to 
E(X)jB(T) in several places. A correct theorem of the sort attempted, namely, one that is easily and 
widdy applicable, would be nice to have. 
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applying the principles of design presented. These principles were first re¬ 
ceived by the reviewer in a course of lectures previous to publication of the 
first edition. Though each previous edition has been read and studied, the 
re\iewer was well rewarded for the time spent on the latest one. 

Familiarity with the statistical procedures presented by the author in his 
Statistical Methods for Research Workers is taken for granted as a background 
for understanding this book. However, the pleasing literary style and non- 
mathematical treatment in all but the last chapter have led some persons 
who lack this background to expect great benefits from reading it. Those 
readers who complain that the book is hard to understand are admitting 
their own shortcomings. 

The course of lectures given by the author previous to the publication of 
the first edition of this book was called The Logic of Experimentation. The 
first two chapters and, to a large extent, the third, are more aptly described 
by this title than by the title of the book. In the first chapter the author 
presents a convincing argument for his theory of inductive inference and 
his reasons for rejecting inverse probability. By means of his classic experi¬ 
ment relating to a lady’s claim that she can determine by taste whether the 
milk or the tea has been placed in a cup first, the whole logical structure of 
critical experimentation, from the statement of the question to the conclusion 
to be drawn from the experimental results, is developed. The roles of ran¬ 
domization and repetition or replication and the necessity of their inclusion 
in a critical experiment are made clear. 

The data on Zea mays from Darwin’s historical experiment on growth rate 
and Galton’s interpretation of them are used to show how fallacious uses of 
statistics lead to discordant results and as a warning against the manipula¬ 
tion of data. The effect of local control, such as pairing or grouping, on the 
sensitiveness of each individual observation is clearly shown. The general 
test of the hypothesis that two samples are random samples from identical 
populations presented in the last section of the third chapter merits greater 
study by statisticians who lack experience in experimentation in some field 
of science such as biology, medicine, or physics. 

To give a clear understanding of the logic of experimentation presented in 
the first three chapters and its practical application, the author uses a num¬ 
ber of examples in chapters 4 to 9 inclusive. In the fourth and fifth chapters, 
simple situations are presented which can be handled by means of random¬ 
ized block or latin and higher square designs which are the simple or basic 
types. In the next four chapters, designs suitable for situations of greater and 
greater complexity are illustrated. The advantages of the factorial arrange¬ 
ment over a series of experiments in which but one factor is varied at a time 
are emphasized in chapter 6. The use of local controls such as confounding 
and partial confounding and of statistical control through concomitant 
measurements are dealt with rather fully in chapters 7, 8, and 9. 

Chapter 10 on the generalization of the null hypothesis and fiducial prob¬ 
ability contains much of interest to the practical experimenter but is es- 
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pecially important for theoretical statisticians. The final chapter on the 
measurement of information is especially commended for serious study, not 
because of the calculus involved, but because the principles in it are ignored 
so frequently. 

Though the author has used the results of actual experiments to illustrate 
the requirements of sound and intelligible experimental designs and means of 
drawing valid conclusions from them, he has avoided producing a compen¬ 
dium of designs which have been used successfully in various fields of ex¬ 
perimentation. It would seem, however, that there is a distinct need for 
such a book. 

Das Harmoniegesetz der Statistik: Eine Untersuchimg fiber die metrische 
Interdependenz der sozialen Erscheimmgen. [The Law of Harmony in Sta¬ 
tistics: An Investigation of the Metrical Interdependence of Social Phenomena.] 
L. 7. Furlan (Professor at the University of Basel). Basel, Switzerland: Verlag 
fur Recht und Gesellschaft A.-G., 1946. Pp. xii, 504. 

Review bt Hilda Geiringer 
Professor of Mathematics, Wheaton College 
Norton, Massachusetts 

T his book presents an investigation on statistics rather than a statistical 
investigation. As the author asserts, the starting point is basically new. 
The purpose is to study great masses of statistical numbers without any 
consideration of their meaning but with respect to some “number-theoreti¬ 
cal” properties. The result is a very definite statement concerning a very 
vaguely defined subject: Consider a mass of statistical numbers, “the way 
they are found in every statistical yearbook.” They form together some kind 
of refiection of the social life of a group of people, of a nation, or of groups of 
nations. Such a mass of statistical numbers, no matter whether it consists 
of only a few or of very many numbers, represents a sample of the infinite 
mass of numbers which describe the social phenomenon, or rather some as¬ 
pects of the social phenomenon. As an example a “collective” consisting of 
1.19X10* numbers is considered (quoted from Statist. Handhuch der WeU- 
wirtschaft, Berlin, 1936), containing data on various fields of economical 
life for eighty countries for the year 1933. (This collective is analyzed as a 
whole and also resolved into 80 partial collectives.) Another example con¬ 
tains an even larger mass of numbers, practically all numbers contained in 
the Statist. Jahrhuch der Schweitz, 1912. (In each of these and other “natural” 
collectives certain adjustments are made before the analysis of the material 
is started.) 

The main problem and aim of this analysis is a very strange one. The first 
figure in a natural number is called its “indicant.” If decimal fractions are 
considered, the first nonzero number is the “indicant.” In a decadic system 
there are nine possible indicants. For each of these nine indicants, their 
relative frequency within the considered collective of numbers is computed. 
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The author’s contention is that, quite independently of the meaning of these 
statistical numbers, if the “natural collective” is only large enough and 
somehow a random collective, there exists a striking regularity concerning 
these nine relative frequencies. (These nine frequencies form the “spectrum” 
of the natural collective.) The statement is that for a natural collective the 
relative frequency of the indicant i, (i=l, 2, .. ., 9) equals log (l+l/i). 
If this is so, i.e., if the relative frequency of the indicant (i) has this value, 
the spectrum is called the “harmonic spectrum,” and this validity of the 
harmonic spectrum for natural collectives constitutes the harmony law of 
statistics. A particular consequence of this law is the fact that the indicant 
“one” appears with relative frequency log 2 =.30; hence, it appears much 
more frequently than it would in a uniform distribution of the nine numbers. 
This is called “concentration on the one” and is considered as the most 
striking and the most important feature of the law. The frequency of the 
numbers 1, 2, . .. , 9 then decreases according to this law. This “harmony 
law” which forms the main result of the 500-page work may be stated once 
more: “The spectrum characteristic of natural collectives is the harmonic 
spectrum” (p. 443). Or, in the more specific form: “The indicant of a statis¬ 
tical number is a chance variable which assumes the values { = 1, 2, . . . , 9 
with probability log (1+1A*)” (p* 468). 

As a theoretical background for these statements certain “artificial,” i.e., 
mathematicallj" defined, collectives (sequences) are considered with respect 
to their “spectra.” Examples are: (o) the sequence of natural numbers 
{n} (where n = 1,2, • • the sequence of the squares of natural numbers 
{n®} (a = 1, 2, • • • )j of the cubes, etc.; (c) the sequence a» (where a is fixed 
and (n = l, 2, • • •); (d) the sequence of all rational numbers in the well 

12 3 n 

known order —,--r, • • •, “ =1? 2, • • • ) (fractions with a com- 

n 71 —1 w —2 1 

mon divisor of numerator and denominator are discarded). For any such 
sequence the relative frequency of the indicant i within the first n elements 
is computed, and the limit of this relative frequency as w is considered. 
It is easily seen that for the first two examples such a limit does not exist. 
(Such “collectives” are termed “erratic.”) On the other hand it is shown that 
for (d), the limit distribution of the indicants is the harmonic distribution; 

i.e., the relative frequency of the indicant i equals log 

It is not the reviewer’s buaness to express an opinion as to whether such 
facts are of mathematical interest from the point of view of pure mathe¬ 
matics. In the re^dewer’s opinion the main problem consists obviously in 
constructing a connection between them and the harmonic property of 
natural collectives, a connection which would allow consideration of a cer¬ 
tain theoretical scheme as a “model” of a natural collective. Such a 
“model” is, in fact, indicated, namely the “artificial collective” (c): {a*} 
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(n = l, 2, • • • )j which shows “approximately” the harmonic spectrum (p. 
303). “The exponential function which expresses the law of natural growth 
forms a scheme of the many small causes which act by way of multiplication, 
being unobservable as long as they are isolated but becoming manifest by 
means of the spectra of natural collectives” (p. 503, translated and con¬ 
densed). “In this way the natural collective becomes an instrument which 
reveals certain properties of the social phenomenon which could not have 
been found in any other way” (p. 502). Thus, on the basis of the interpreta¬ 
tion of natural collectives by means of the sequence {a*} a theory of the 
natural collectives is somehow attempted. This theory is, to say the least, 
extremely vague. 

It may well be that a mathematician is not in the position to do justice 
to a work belonging to the field of social sciences. With this reservation a few 
comments may be allowed. It is hard to decide whether it is possible to 
give precise definitions to the main concepts introduced in the statement of 
the harmony law, as “natural collective,” “statistical number,” etc. There 
seems to be little point in making mathematically precise statements con¬ 
cerning incompletely and vaguely defined subjects. If such definitions are 
given the questions will remain: (a) whether this law expresses a theoretically 
consistent statement, and (6) whether in its application to reality it states 
a significant feature of reality, i.e. of the social reality behind the natural 
collectives. This seems to the reviewer rather improbable, certainly so far as 
mathematically precise statements are concerned. 

If we look in a purely empirical way at the examples in Furlan's book 
(e.g., p. 22) or if we examine, on our own, some statistics in yearbooks, it 
actually seemed to the reviewer as if there existed a certain vague “concen¬ 
tration on the one” as first figure. But here again some rather obvious re¬ 
marks force themselves upon us, e.g., that this concentration may be de¬ 
stroyed by a change of unit (“100 square miles” instead of “100 hectares,” or 
“per 5000 people” instead of “per 1000 people”), or that statistical numbers 
have upper and lower bounds (think of the possible numbers of children in a 
family, to quote a trivial example) which influence the indicants. If we con¬ 
sider this question of unit a little closer it seems that the “harmony law” 
ought to be independent against multiplication by a constant in order to 
be independent of the unit of measurement. This would imply the following 
theoretical statement: Consider an “artificial collective,” i.e., a sequence of 
numbers where the indicant, i, appears with a limit of relative frequency 
equal to log (i-f 1) —log i, as postulated by the harmony law. If in such a 
sequence each number is multiplied by one and the same constant the rela¬ 
tive frequency of the nine indicants should remain unaltered. This however 
is certainly not true. As a simple conterexample consider a sequence consisting 
only of the nine numbers, 1,2, • - -, 9 which are each written down so often 
that the number i occurs with the frequency (or more precisely with the limit 
of relative frequency) equal to log (1 -fl/i). Now form a new sequence by mul¬ 
tiplying each number in the original sequence by the constant 2. This new se- 
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quence will consist of the numbers 2, 4, 6, • • •, 18 where the “2” will appear 
with frequency log 2, the “4” with frequency log i, * • • the “18” with fre¬ 
quency log 10/9. In this new sequence the indicant “1” appears in 10,12,14, 
16, 18 with frequency equal to log 6/5+log 7/6+ • ■ ■ +log 10/9=log 
(6/5 • 7/6 • 8/7 • 9/8 • 10/9) = log 2, hence in fact again with frequency log 2. But, 
e.g., the indicants 3, and 5, and 7, and 9 will each have frequency zero instead 
of the frequencies postulated by the harmony law which thus appears to be 
completely destroyed by this operation.* Undoubtedly such obvious remarks 
must have occurred to the author. They apply, in this form, of course to arti¬ 
ficial, i.e., mathematical collectives oidy. A certain empirical regularity like 
the concentration on the one may be observable on “natural collectives” 
under certain specified conditions. However, an empirical observation, even 
if correct and fairly general hardly constitutes a comdncing basis for a far 
reaching general mathematical statement. And the theoretical foundation by 
means of a mathematical model (see our remarks concerning {a*} can hardly 
be considered at this stage as much more than an apergu. 

An impeding feature of the book is the lack of an index. This is particu¬ 
larly disagreeable since so many new and unusual words and notations are 
used. Otherwise the work is certainly extremel 3 ^ well written and very^ original. 
It remains to be seen whether this whole problem of the relative frequency of 
the indicants of statistical numbers reveals—by means of a suitable inter¬ 
pretation—^aspects of social phenomena which are of interest to social scien¬ 
tists. 

Fundamentals of Statistics. Truman Lee Kelley (Professor of Education, Har¬ 
vard University, Cambridge, Mass.). Cambridge 38, Mass.: Harvard University 
Press (38 Quincy St.), 1947. Pp. xvi, 755. Lithotyped. $10.00. (London E.C. 4: 
Oxford University Press [Amen House, Warwick Square), 1948. 55s.) 

Reiteew bt Ctbil Burt 
Professor of Psychology, University College 
University of London, London, England 

T his book is more than a revised and expanded edition of Kelley’s well- 
known Statistical Method (Macmillan Go., 1923). But it preserves many 
of the admirable features of that excellent work. In his approach, as the 
author pointed out at the time, his “greatest inspiration” had come from 
Karl Pearson; and he found “the English School had been most contribu- 
tive.” The present volume shows how Pearson’s general conception of statis¬ 
tics and the numerous contributions that he made on fundamental problems 

* It is obvious that the same eonterexample holds if instead of the losarithmic frequencies, postu¬ 
lated by the harmony law, any other nine frequencies, pt,. . . pt with sum equal to one, are assumed. 
It seems to the reviewer that from a purely mathematical point of view, it is, the concept of the *^di- 
cant” which is open to objections rather than the particular frequency distribution consldeied by the 
author. 
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may still serve as the foundation on which the newer additions of more recent 
years may be superposed. 

The chief alteration in the new volume lies in the greater attention paid to 
the logical principles underlying statistical reasoning. Statistical procedures 
are presented, not as a “bag of useful tricks and dodges,” but as a mode of 
reasoning about complex problems. This point of view is developed clearly 
and convincingly in the opening chapter. 

The chapters that follow, like those of the earlier book, deal in order with 
statistical series and statistical tables, graphic methods in two and three 
dimensions, and the “stable features of phenomena” (a topic that was only 
briefly touched upon in the earlier volume). This leads to the methods of 
measuring central tendency and variability, and the properties of the normal 
distribution. A fresh chapter is then inserted on the statistics of attributes; 
the use of chi-square and the variance ratio is introduced in this connection. 
We then proceed, much as before, to the problems of correlation, regression, 
and multiple correlation. The latter, it may be remembered, Kelley dealt 
with on Pearsonian lines in terms of determinants. It is therefore natural for 
him now to introduce the uses of matrix methods in this connection. 

The original chapter on “sundry statistical problems and procedures” is 
considerably enlarged. Some of the newer topics are those of periodicity, 
lead and lag in time series, direct and inverse interpolation, the variance 
errors of different statistics, and (another welcome addition) sequential 
analysis. 

The last chapter is entitled “mathematics: the mentor of statistical in¬ 
genuity.” Its main purpose is to provide, in compact form, a mathematical 
background for the experimentalist, so that he can equip himself with a 
knowledge of the particular functions that are most likely to be suggestive 
of phenomenological relations. Accordingly, in addition to a discussion of the 
point binomial, the Poisson distribution, and the hypergeometric series, we 
have sections on the Fourier series, factorials, growth curves, matrix algebra, 
the determinantal solution of simultaneous equations, and other topics. The 
book concludes with a useful set of statistical tables, a mathematical back¬ 
ground test, and lists of symbols and common mathematical terms (with 
definitions), and a key to formulae. 

It will be seen that, while the book includes nearly all the stock topics, it 
also contains much useful matter that is not to be found in the usual type of 
statistical textbook. As the author points out, statistics has developed so 
rapidly during the last twenty-five years that it would be impossible for 
anything short of an encyclopaedia to cover the entire ground. It is there¬ 
fore easy for the critic to point out omissions. The psychologist will probably 
regret that more attention has not been given to the analysis of variance and 
to factor analysis. In the discussion of the logical basis it is perhaps disap¬ 
pointing that more space has not been given to the theory of probability and 
its relevance to tests of significance, and particularly to the latest develop¬ 
ments in the theory of estimation. 
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Review by Horace W. Norton 
U. S, Atomic Energy Commi^ion* 

Oak RidgCf Tennessee 

F undamentals oJ Staiistics is an elementary textbook written by a well- 
known psycholo^st. Although the early chapters usually emploj' little 
mathematics, the later chapters use a fair amount of algebra (especially in 
treating determinants and matrices), and there is an occasional appeal to 
the calculus. The book is addressed to no specified class of readers, but it is 
fair to say that in flavor, language, examples, references, and point of view, 
it is definitely of the social sciences. Its object is clearly stated in the follow¬ 
ing quotation: 

The usual elementary text in statistics seeks to acquaint students with 
the techniques necessary to obtain. .. averages, measures of variability 
[etc.]. . . . This text considers these things as a secondary aim, the primary 
aim being to acquaint the student with the logic back of these procedures. 

It is believed to be more important... to understand why a mean is em¬ 
ployed than to know the steps for calculating it. . • (p. 57). 

As a text, it should be regarded as of more nearly 600 pages than 750, roughly 
200 pages being given to miscellaneous aids, such as mathematics, references, 
tables, and indexes. There are a few problems, and much emphasis on vo¬ 
cabulary. 

The first two chapters are introductory, exhibiting to the reader the mean¬ 
ing, the scope, and the uses of statistics. Thirty pages are given to a discus¬ 
sion of the mathematical background needed for statistical reading, quoting 
the Social Science Research Council report (1932) and including brief ex¬ 
planation or comment on a number of elementary mathematical topics. This 
is unusual by itself, and it is therefore especially noteworthy that there is 
also a mathematical background test with instructions and scoring key, en¬ 
abling the student to assess objectively his readiness to read this book. The 
next nine chapters (275 pages) are conventional in subject matter, treating 
statistical tables, graphics, the notion of stable features of variable phe¬ 
nomena, measures of variability and of central tendency, the normal distribu¬ 
tion, attributes, regression and correlation, and multiple regression. Chapter 
13 is a miscellany, including time series and periodicity, methods of arith¬ 
metic such as machine extraction of roots, and sequential analysis. Chapter 
14 is chiefly mathematics, but some statistical topics occur, such as moments 
of the binomial and Poisson distributions, and some transformations. The 
fifteenth and last chapter gives a few mathematical and statistical tables 
and a considerable list of references to such tables. Appendix A gives the 
mathematical background test already mentioned. Appendix B fists symbols 
and terms, formulas given in the text, and references. 

Because the author has not designated a specific group of readers as his 


* Hus review siiould not be construed as representing the ofSicial position of the U. S. Atomic 
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target, critical evaluation is not easy. Moments are emphasized, and there 
is a lengthy discussion of the correlation coefficient, its deficiencies, correc¬ 
tions, interpretation, and uses; and of matrices and determinants; solution of 
linear equations; and partial and multiple regression.The analysis of variance 
is mentioned a number of times but without systematic treatment. There is 
nothing on independent comparisons or single degrees of freedom, no mention 
of components of variance, no analysis of variance tables, and no worked 
example. Analysis of covariance is not mentioned, and experimental design 
receives only a half page. All this may be accepted by students and teachers 
in the social sciences, though its adequacy for their needs is surely doubtful. 
However this may be, it certainly will be unacceptable to most, perhaps all, 
other beginning students of statistics, though doubtless many will realize 
this only when they have ceased to be beginners. 

The lack of systematic treatment of the analysis of variance and covari¬ 
ance is the more difficult to understand because it is so widely regarded 
not only as fundamentally important to applied statistics but also as so help¬ 
ful to the author’s stated primary objective of emphasizing the lo^c and 
understanding of statistics. Likewise, systematic treatment of experimental 
design would have been helpful to the same end, revealing as it does an en¬ 
tirely different aspect of the statistical whole, even if it were kept to only a 
few pages. Nevertheless, the author pursues his objective with some success, 
chiefly in the early chapters. He has done well, for example, in formulating 
some of the quasi-axioms of statistical thought, which can be so helpful to 
the tyro who remembers them, such as: 

The reader should develop the habit of appreciating that an answer in 
terms of probability exists for every observed difference between two statistical 
measures, even though he does not have the necessary ability, or perhaps 
data, to arrive at a precise statement of it. Just as the error in statistical 
statements is ubiquitous, so should be the awareness by the student of this 
fact (p. 78). 

On the other hand, it should not be supposed that misleading and errone¬ 
ous statements have been entirely avoided, as the following examples show. 
It is asserted (p. 73, using hypothetical temperature forecasts!) that the error 
is independent of the prediction “as it universally must be.” There may be a 
deflnition of “error” which makes this statement true, but it need not be and 
usually will not be for real data analogous to the example. After a test for 
normality resulting in rejection of the hypothesis, we read, “knowing that 
successive daily maximum temperatures at a single spot are correlated and 
not independent, we have a ready explanation in that our items are not in¬ 
dependent random samplings” (pp. 304-5), but of course this has nothing 
to do with the normality of distribution of either the sample or the parent 
population. A graph of the chi-square distribution for one degree of freedom 
(p. 286) shows an inflection where chi-square is about 0.13. Of t it is said that 
“It is the distribution of [means, differences of means, and of regression 
coefficients] .. . computed from small samples drawn from a parent normal 
population” (p. 286), obscuring the nature of this revolutionary statistic. 
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We are told (p. 226) that the expected value of the standard deviation is the 
square root of the expected value of the variance. There is the old error of 
“explaining” kurtosis simply in terms of the relative frequencies near the 
mean and in the tails (p. 242). 

In addition to faulty statements, there are errors in formulae and tables, 
many of which are probably typographical, though none the less conse¬ 
quential to the student. For example, formulae for Fisher’s fourth h statistic 
appear on pages 215 and 247, neither being correct, and a few errors were 
noted in the table (pp. 640-52) of normal probability functions. 

The list of tables of statistical functions (including the usual mathematical 
tables) is excellent, each table being described as to content. It was a pleasure 
to the reviewer to find himself correctly credited for the table of the 20 per 
cent level of the z and variance ratio distributions. However, this pleasure 
was immediately mixed by inclusion of the 0.1 per cent table which of course 
is due to C. G. Colcord and L. S. Doming. The list of references, to both 
books and papers, would be much increased in value by elimination of such 
entries as Peters and Van Voorhis, Statistical Procedures and Their Mather- 
matical Bases or, better still, by furnishing critical comments for each entry. 

The notation seems unnecessarily cumbersome, and the terminology is 
sometimes unfamiliar, not to say strange. For example, there is frequent 
use of a symbol with both a bar and a tilde, the value found in a table is called 
the “consequent,” as the value used to enter the table is called an “argu¬ 
ment,” and the reviewer is still wondering what the “xlnt” correlation chart 
is. This may be the appropriate place to remark that both “random” and 
“bias” are often used but never defined. 

The author indulges in occasional fiights of fancy as to subject matter or 
phrasing which serve to enliven the text, as when, in discussing significant 
figures, he considers a palsied man using a meter stick to measure a live 
angleworm. 

The typography leaves much to be desired, both in appearance and accu¬ 
racy, the spacing of lines, words, and letters being irregular, and tj^iographi- 
cal errors being frequent, especially in names. The index suffers from numer¬ 
ous errors of omission and commission, and additional cross-referencing 
would be helpful. 


Mathematics of Statistics, Part One, Second Edition. John F, Kenney (Assistant 
Professor of Mathematics, University of Wisconsin at Milwaukee). New York 3: 
D. Van Nostrand Company, Inc. (250 Fourth Avenue), 1947. Pp. xii, 260. $3.75. 
(London W.C. 2: Macmillan & Co. Ltd. [10 St. Martin’s St., Leicester Square]. 
21s.) Two remews follow: 

Review by Howard Levene 
Lecturer in Mathematical Statistics 
CoLumhia University^ New York, AT. F. 

T ms is a slight revision, with minor clarifications and additions, of the 
first (1939) edition. The book deals almost exclusively with descriptive 
statistics, and requires no mathematics beyond high school algebra. 
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The first hundred pages deal with the mechanics and graphical representa¬ 
tion of frequency distributions, the usual measures of central tendency and 
dispersion, and higher moments. These are followed by chapters on the nor¬ 
mal curve, curve fitting, and correlation. The use of tables of the normal 
curve for curve fitting and graduation by areas is discussed. There are sec¬ 
tions on fitting straight lines and second degree parabolas. An exponential 
trend is fitted by using the method of least squares on the logarithms, with 
a footnote warning that this is not the same as a least squares fit to the 
exponential itself. Two pages are devoted to fitting the Gompertz curve 
without an explanation of its use, while the logistic and Makeham’s curve 
are merely mentioned and references given. 

The summation notation is introduced early and used consistently 
throughout the text. This lends much greater clarity to the extensive dis¬ 
cussion and proofs of various moment relations than is usual in a text on this 
level. For example, for k sets of variates combined into one set, the author 
obtains the relation 

‘T* = — 2 + 0*i.- 

N 

where 

= -^22 (Xa - X)*, 

N 1 

and states that this will prove important in the analysis of variance. 

This book is clear and well written and should be an excellent text for a 
course dealing with the material it covers. However, the reviewer questions 
the purposes served by such a course. The word "probability” appears for the 
fiffst time on page 114 and is occasionally used thereafter, but a discussion of 
the concept is deferred to Part II (not here reviewed), and the notion of 
sampling and sampling error appears only in vague form in some of the dis¬ 
cussions. The same letter is used throughout for both parameter and statistic. 
Sampling of attributes is not mentioned, presumably since a coherent dis¬ 
cussion of it would require probability theory. Because of these and other 
deficiencies the book is not suitable for a terminal course in statistics. Stu¬ 
dents intending to go on to a "practical” course in statistics do not require 
such a thorough algebraic foundation, while those intending to take further 
courses in mathematical statistics might do better to devote their time to 
pure mathematics, so as to be prepared for an introductory statistics course 
using the calculus and probability theory. With the difficulty of finding time 
in the curriculum for all the subjects a student should study, it is doubtful 
if time should be devoted to material which must be largely repeated, in 
somewhat different form, in later courses. 

Both the virtues and the defects of the book are well shown in the chapter 
on correlation. There is a good discussion of the nature of normal correla¬ 
tion. The normal bivariate surface is treated briefly, but the major mathe¬ 
matical treatment in this chapter is in terms of an existing finite sample. 
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Conclusioiis are drawn from this about the predictive value and the amount 
of relationship implied by a given correlation coeiEcient. While there is verbal 
reference to some vague “large*^ set of values or population, it would seem 
that this treatment would be confusing. Blakeman's criterion of linearity of 
regression is given, with a warning that it is not reliable and a reference to 
Part II for the analysis of variance test. The example on page 220 for which 
Blakeman’s criterion is said to break down is poorly chosen, since the analy¬ 
sis of variance test is not applicable. Blakeman’s criterion is the only sig¬ 
nificance test mentioned in the present volume, there being no mention that 
a criterion for existence of regression would be desirable. 

There are a few minor errors, misprints and passages to which exception 
could be taken, but there is only one that the re^wer feels he should men¬ 
tion. On pages 147 ff. it is recommended that when two variables have a 
linear relationship and both are subject to errors, the sum of squares of per¬ 
pendicular deviations from the fitted line should be minimized. This method 
is not invariant under change of scale of measurement. The problem of the 
determination of structural relationships is an important one, but requires 
a much more complete specification of the underlying chance mechanism, 
and even then cannot alwaj’s be solved. 


Ee’vtbw by Ronald W. Shephard 
Assistant Professor of Mathematics and Economics 
New York Universitij, New York, N. F. 


G enerally speaking this rension is not a substantial alteration of the 
pre\uous edition—an elementary textbook on descriptive statistics, 
much used and a pedagogically sound treatment. The topics covered, their 
order and method of presentation, and the illustrative material are identical. 
The altered format of the revision results in a change of page numbers for 
corresponding textual material, which is a nuisance for those wishing to use 
it along with the earlier edition, but it has produced a better integration of 
figures and tables with related discussion. Still, the paragraph numbers are 
identical and one may avoid the changed page numbering. 

It is difficult to find any changes in the textual discussion, and most stu¬ 
dents would use the revision along with the earlier edition without recogniz¬ 
ing any differences. A new paragraph on variables and constants is added at 
the befflnning of Chapter 1, and a not very useful paragraph on probability 
in Chapter 6. In the chapter on curve fitting a new paragraph is added deal¬ 
ing with logarithmic coordinate paper, and immediately preceding this 
Kenney has introduced a useful discussion of graphical determination of an 
exponential function with two new illustrative figures which change the en¬ 
suing numbering. Otherwise the alterations of test consist in a sentence here 
and there. The tables in the revision and earlier edition are the same. 

One would naturally expect substantial change and improvement of 
student exercises, but this has not been provided. The alterations consist 
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only in the addition of a few new exercises and constitute a disturbing de¬ 
ficiency of the revision for those looking for a new edition of this book. 

More fundamentally, this revision is an example of a practice of publishers 
to rush into new editions at this time without having ready a thoroughgoing 
rewrite of their popular textbooks, resulting in little or no incorporation of 
new treatments and new applications of methods which hav^e occurred during 
the war years. This is not to be taken as a criticism of Mr, Kenney’s general 
method of treatment, which the reviewer has found to be a relief from the 
usual elementary textbook on descriptive statistics in the social sciences. 


Descriptive and Sampling Statistics. John Gray Peatman (Associate Dean and 
Associate Professor of Psychology, City College of New York, New York, N. Y.) 
New York 16: Harper & Brothers (49 East 33rd St.), 1947. Pp. xviii, 577. S4.50. 

Review by Leon Festinger 
Research Center for Group Dynamics 
Massachuseits Institvie of Technology, Cambridge, Massachusetts 

I N THE past two years there has been somewhat of a spurt in the publication 
of books on statistics, the reason for which is not too evident. Certainly 
man 3 ' of the books on statistics which were published before the war had 
their little inadequacies, but the books published since the war seem on the 
whole to possess the same inadequacies or else to substitute deficiencies of 
their own for those of their predecessors. 

The motivation for the publication of these new books does not seem to 
be the presentation of new material either. While the war period did produce 
a number of new developments in statistical methods, these are rarely in¬ 
cluded in the new books which appear. Perhaps the explanation lies in the 
differences of opinion, which no doubt exist, as to what to emphasize and how 
to teach statistics. The implication is certainly present, however, that a new 
book on statistics should be evaluated not only in terms of what it includes 
and how it jMresents its material but also in terms of the jjossible justification 
for using this new book rather than one which abeady happens to repose on 
the library shelf. 

The title of the book under consideration. Descriptive and Sampling Star 
tistics, correctly implies that the book treats separately the use of statistics 
for purely descriptive purposes—^to summarize data—and the use of sta¬ 
tistics as a tool for testing hypotheses concerning population parameters. 
These two sections of the book are of very different quality and must be 
evaluated separately. 

The first part of the book, entitled “Descriptive Statistics” is, with respect 
to the content it covers, rather comprehensive. There is the customary array 
of frequency distributions, graphical representations, percentiles, means, 
medians, measures of deviation, and the like. There are some differences, 
however, between this and other similar texts. For one thing the discrepan- 
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cies in terminology among different writers on statistics have led the present 
author to seek to change this terminology at various places. We thus find 
terms like “non-variable data” defined to mean what is usually meant by dis¬ 
crete data which does not necessarily form a scale, and we find the invention 
of new words such as vi^ntile, tercile deviation, and quintiserial correlation. 
Another difference which we find in this book is the somewhat heavier 
emphasis on the use of deciles and percentiles (which the author prefers to 
call centiles) than one usually finds. With this greater emphasis the author 
is able to point out quite clearly how the median is simply a particularly 
important centile value, and how the quartile deviation is one of a class of 
such deviational measures which might be computed. 

In the preface to the book the author explains that he has “attempted a 
balanced presentation that will teach the student not only how to compute 
a statistical measure but when to use a particular technique and how to 
interpret a result.” The degree to which this balance has been achieved, how¬ 
ever, is open to some question. There is no doubt that a student who has 
carefully digested the material in the book will be able to compute the vari¬ 
ous measures which are presented. There are detailed steps for computation 
and numbers of illustrations of such computation. There is some doubt, 
however, whether this student will have an adequate enough understanding 
of what he is doing and why to insure the use of the proper measure where 
alternatives are available, and to insure adequate interpretation. In most 
places the book goes quite thoroughly into what to do but skims very lightly 
over the “why.” For example, the reader of the book is told to correct a phi 
coefficient between two dichotomized variables by dividing by ,637, but no¬ 
where is there any mention of the reason for such a correction, the assump¬ 
tions underlying this correction, or the conditions under which the correction 
should or should not be used. 

The second section of the book, entitled “Sampling and Analytical Statis¬ 
tics,” is of quite different character from similar sections in other textbooks. 
Some of these differences are commendable while others are rather unfor¬ 
tunate. 

The present book is one of the very few which have attempted to include 
a discussion of sampling problems in public opinion research. The discussion 
is relatively brief, occupying only fourteen pages, but nevertheless does 
present the major techniques such as stratification, area sampling, and quota 
sampling. There is perhaps not sufficient critical evaluation of these tech¬ 
niques and not sufficient discussion of the problems of each. There is also, 
unfortunately, no presentation of sampling error formulae for dealing with 
these various kinds of samples. 

Another feature of the book is a short but relatively good discussion of 
discrete probability distributions. This material, generally omitted from 
elementary statistics texts, includes important and useful statistical tools. 
This section, in the book under consideration, covers the theorems dealing 
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mth the addition and multiplication of probabilities and discusses the bi¬ 
nomial expansion in detail. 

Unfortunately, the remainder of the second part of the book is of rather 
inferior quality. The presentation consists almost exclusively of tests which 
might be justifiable when the number of cases in the sample is sufficiently 
large so that anything and everything distributes itself normally. The only 
exception made is for correlation coefficients where Fisher’s s transformation 
is given. The t test is dismissed in one page with the idea that it is not very 
useful in psychology and the social sciences; the F ratio for significance of 
difference between variances is never mentioned; the techniques of analysis 
of variance are entirely omitted and the discussion of the chi-square test is 
rather poor. It may be that this selection of material was determined by the 
idea that this was to be an elementary text, but the reviewer finds difficulty 
understanding why outdated and inferior procedures are more elementary 
than newer and more adequate procedures. 

It would undoubtedly be extremely difficult to write a textbook on sta¬ 
tistics which was suitable for use in an elementary course without having 
some inaccuracies in the presentation. In the present textbook, however, 
there seem to be many inaccuracies which could be easily removed. We will 
list two of these doubtful statements so as to give the reader an idea of their 
nature. 

Thus, for a series of samples of less than 25 or 30 cases each, the form of 
the sampling distribution of any statistic derived therefrom will skew more 
and more from the normal probability curve, the smaller the size of the 
sample (p. 324). 

.. . chance errors are errors that are just as likely to occur as not to occur; 
hence they are equiprobable (p. 332). 

Whether or not readers will find Descriptive and Sampling Statistics a 
worthwhile text to use will undoubtedly depend largely upon their personal 
preferences as to what should be taught in an elementary statistics course 
and how it should be taught. 

Review by Fredebick F. Stephan 
Director of Study of Education and Professor of Statistics 
Princeton University, Princeton, N. J. 

H ow should a textbook be appraised—terms of its competition or in 
terms of the needs it should satisfy? Compared with other texts in cur¬ 
rent use, this book has several commendable features that will appeal to 
some teachers and students. For the most part, these features consist of 
interesting discussions of the meaning and applications of statistical methods 
in psychological testing and public opinion polls. Among them one may men¬ 
tion an interesting sketch of the historical background of statistics and 
several passages in the sections on sampling, test validity, and correlation 
that reveal considerable skUl and ingenuity in presentaMon. However, there 
is another standard by which a textbook may be judged, namely, its effec- 



338 


AMERICAN STATISTICAL ASSOCIATION 


tiveness in remedying the defects of other books and bringing the teaching 
of the introductory course abreast of recent developments in basic theory 
and method. On this standard, the book does not appear to be satisfactory. 
To too great an extent it seems to be a rewriting of the content of familiar 
textbooks, principally Peters and Van Voorhis, and parts of Croxton and 
Cowden, and Yule and Kendall, rather than a fresh and modern formulation 
of the introductory course based on current research applications and meth¬ 
odological advances. It even falls behind these books (published some ten 
years earlier) in many respects. For example, the reviewer could not find as 
much as a casual reference to the analj^sis of variance. 

Of course, there is plenty of room for disagreement about what is appro¬ 
priate to a first course, even one of a year’s duration, especially when it is 
addressed, as this is, to students with no college mathematics. Yet, if the 
author really expects to ^ve students “the ability to select the most relevant 
statistical technique and to make the necessary computations with a mini¬ 
mum of error” (p. xvii), he could well include a discussion of such topics as 
the efficiency of estimating functions and the power functions of statistical 
tests, at least to the extent of providing a clear statement of the problem to 
be solved in each case. Certainly these concepts are fundamental to an under¬ 
standing of the logic that underlies the application of statistical methods 
and they can be presented simply enough to students who can understand 
the great variety of correlation methods that are discussed in great detail. 

While the book is intended for research workers in the social and biological 
sciences, it neglects important parts of these fields and emphasizes psycholog¬ 
ical and educational testing and public opinion polling. It slights psycholog¬ 
ical experimentation, genetics, and other major regions of the application of 
statistical method to research in these fields. 

The book is divided almost equally between “descriptive” statistics and 
“sampling and analytical statistics.” As a consequence the normal distribu¬ 
tion and correlation are discussed extensively before sampling and probabil¬ 
ity. This separation is harmful, in the opinion of the reviewer, for it obscures 
the extent to which statistical theory has been unified. 

The topics that are covered include: categorical data, proportions, ratios, 
index numbers, percentages, correlation of categorical data, frequency 
distributions, centiles, mean, standard deviation, the normal curve, correla¬ 
tion, sampling, probability, tests of significance, chi square, prediction from 
correlations, evaluation of psychological tests, and cluster and factor analysis. 

The author took pains to avoid sacrificing accuracy in attempting to write 
clearly but there are a number of places at which statements are confused or 
incorrect when read literally. A few examples will be cited: 

An IBM card is illustrated and described as having ten positions in each 
column (p. 38). No mention is made of the x and 12 or v positions. 

“The arithmetic mean ... and the standard deviation ,., The method 
of their computation is essentially algebraic and is often described as the 
method of moments” (p. 150). 
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“The normal distribution ... is a unimodal distribution with the greatest 
number of frequencies at the mean” (p. 175). The discussion of the normal 
distribution is inadequate; ordinates are given as probabilities (p. 184). 

If a test of significance does not indicate that the difference is significantly 
greater than zero, “we can be confident that no real differences exist in the 
characteristics compared” (p. 401). 

There is a meager one-page bibliography. It does not include such books 
as Snedecor or Lindquist, which are needed to cover important fields that 
this book slights. The appearance of books like this emphasizes the impor¬ 
tance of more effective communication between statisticians in various fields 
and improvement in the teaching of statistics, two fimctions in which the 
American Statistical Association should be increasingly active. 


Fundamentals of Statistics. J. B, Scarborough (Professor of Mathematics 
United States Naval Academy, Annapolis, Md.), and R. TV. Wagner (Assistant 
Professor of Mathematics, Oberlin College, Oberlin, Ohio). Boston 17, Mass.; 
Ginn and Co. (Statler Building), 1948. Pp. vii, 145. $2.50. 

Review by Leo Aroiax 

Assistant Professor of Mathematics^ Hunter College 
New York 21, New York 

T he appearance of a new text on statistics is always a source of pleasant 
anticipation for the harassed teacher searching hopefully for the ideal 
text. The present text, intended for engineering and science students, 
found its birth in the need for a short course of twenty lessons at the United 
States Naval Academy. An elementary knowledge of calculus is assumed. 
The book is remarkably brief. Simplicity and clarity are stressed throughout. 
The eight chapter headings and appendices are representation of data, aver¬ 
ages, measures of dispersion, comparison of distributions, correlation, prob¬ 
ability functions, the normal curve and a generalization, sampling, one ap¬ 
pendix on probability, a second one on the tables of the normal curve and the 
second, third, and fourth derivatives of it, a third one listing nine texts on 
statistics recommended for collateral reading, and five pages of answers to 
problems. 

The originality in conception of the book must be highly praised. Undoubt¬ 
edly it will be extensively imitated. The mathematics carries the burden of 
the explanations. Very few words are wasted. The authors exhibit consider¬ 
able ingenuity in the development of the topics and are at their best in des¬ 
criptive statistics, excellent in their treatment of probability and sampling 
once their point of view is granted. They have not been afraid to drop many 
topics considered sacrosanct such as Sheppard's corrections. Alany problems 
illustrate the main points of the text. A nice balance is maintained between 
theory and numerical computation. Yet there is a greater nod to the past 
than to the future. 

Fundamentally the reviewer disagrees with the authors’ philosophy of 
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testing hypotheses. Some intuitive explanation of the modern theories of 
probability as developed by von Mises and Kolmogoroff would be desirable. 
In many places too little explanation is given. Only two-sided tests of hy¬ 
potheses are mentioned and needless to say nothing concerning Type II 
errors, the error of accepting a false hypothesis. For sample means, N large, 
this is a simple matter. More serious is the confusion of two entirely different 
problems: "Given a probability function, all parameters known including the 
population mean, what is the probability of obtaining a sample mean x 
greater than xo?” with the problem “A sample of N variates is drawn, N 
large, from an unknown population (possibly of known functional form), 
a statistic is calculated say the sample mean, what can be said of the popula¬ 
tion mean?” On page 106 the statement is made:" • • • Npop ± 

vw —1 

This is interpreted to mean that the probability is about 2/3 that the mean 
of the population is within the range Maamp—o'jf to ilf.amp. "(!) Actually 
after a sample is drawn the probability that the population mean should lie 
in such an interval is zero or one. What is needed here is the theory of confi¬ 
dence intervals, certainly a fundamental topic. 

Some important topics omitted are significant figures, the range, /-shaped 
curves, the chi-square test of goodness of fit, analysis of variance, control 
charts, and small sampling theory. Topics which might readily have been 
omitted are the quartile deviation, the coefficient of curvilinear correlation, 
and the coefficient of rank correlation. On page 24, <r»=acry should read 
O'* =* I a|<ry. The common error concerning ou, the measure of kurtosis, occurs 
on page 33. The attempt is made to justify their statements by use of a 
Gram-Charlier Type A of three terms including a^. As Cramer remarks in 

lOas* 

his recent text (p. 184, p. 230) an additional term—is required; 

6 ! 

consequently the authors' remark is not accurate even for a Gram-Charlier 
series in which the terms are grouped according to the same order of mag¬ 
nitude. Naturally in some particular cases a4>3 is associated with a high 
peak at the mean. The final discussion on the limitations of the coefficient 
of correlation is good. 

A word might be said about notation. The letter x is used to represent the 
original variable, a variable with mean zero, and a variable with mean zero 
and standard deviation 1. The mean is usually written as M whether refer¬ 
ring to a sample mean or a population mean, although occasionally a dis¬ 
tinction is made as on page 105. The same remark applies to the standard 
deviation and the measures of skewness and of kurtosis. 

It is a pity that in the section on collateral reading no mention is made of 
the recent books by H. Cramer, H. A. Freeman, P. Hoel, A. Wald, C. E. 
Weatherburn, and S. S. Wilks. 
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Practical Rules for Graphic Presentation of Business Statistics. L. Edwin 
Smart (Professor of Economics) and Sam Arnold (Assistant Professor in Busi¬ 
ness Research). (Ohio State University.) Columbus, Ohio: Bureau of Business 
Research, College of Commerce and Administration, Ohio State University, 1947. 
Pp. xii, 86. $2.00. 

Review by Rudolf Modley 
President^ Executive Research, Inc. 

350 Fifth Ave., New York 1, N. Y. 

T he authors of this small book have set themselves the task to write a sim¬ 
ple “How to Do” book on the graphic presentation of statistics for busi¬ 
nessmen and students. In going about their job they have set forth specific 
rules applying to all charts and rules applying only to some types of charts. 
Each text discussion is accompanied by one or more illustrations. The book 
deals with bar charts, pie charts, line charts, three dimensional charts, pic¬ 
torial charts, and maps. The last third of the book presents samples of busi¬ 
ness charts. In presenting text and charts the authors do not discuss why 
something should be done in a certain way. They rarely present alternative 
methods of presentation; they only give the rules and leave it at that. 

There can be little doubt that a simple book of this nature is badly needed 
today. The use of charts has been increasing rapidly and many business¬ 
men need quick and simple guidance on how to draw charts. Unfortunately, 
“Practical Rules” cannot be recommended as filling this need. There are 
several reasons why it does not. 

a) The proof of a chart is in its looks. The least we can expect from a book 
on graphic presentation is that the charts are well laid out, well designed, 
and fit harmoniously into the book. This is not the case in the book under 
review. Nearly all the charts look awkward; they lack proper balance within 
themselves and on the page. There is poor selection of size for lettering and 
poor weight selection for rulings and curves. The instructions on lettering 
sizes (p. 6) are insufficient, instructions on grid and curve weights (p. 34) 
limited to “light” and “heavy.” Unnecessary or incorrect rules, as for in¬ 
stance, “that the grid will be wider than it is long in somewhat the same 
proportion as 5 is to 3” (p. 7) or that pictorial symbols should be “mere out¬ 
lines with little detail” (p. 48) also make it more difficult to obtain clear and 
forceful charts. 

b) This leads us to the second shortcoming of the book. Instructions are 
usually given as if there were “generally accepted standards” of chartinak- 
ing. While there arc some such standards it is obvious to all of us who deal 
with charts that effective presentation calls for considerable flexibility. Most 
of us will make trial layouts before we draw up a chart. We do so because 
this permits us to make adjustments in the many different elements which go 
into a chart. We reorganize or eliminate some elements, we emphasize others, 
we adjust dimensions and sizes of lettering until we are satisfied that the 
chart is truly effective. Only then do we begin the final draft. If the authors 
had tried harder to make the draftsman understand what one can “get out” 
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of a chart by different treatments instead of giving a set of seemingly in¬ 
flexible rules, a more helpful book would have resulted. 

c) The authors state that they did not want to “give the pros and cons of 
various practices.” (Actually they have done so occasionally displaying a 
preference for 100 per cent component bar charts over pie charts and ab¬ 
solute component bar charts.) It seems to this reviewer that an objective list¬ 
ing of the advantages and disadvantages of certain types would be essential 
in a practical book for those who cannot be expected to judge all the ele¬ 
ments which go into making a chart correct and effective. Alternative ways 
of presenting the same set of data would also have been helpful. The wealth 
of samples included in Brinton's Graphic Presentation is a considerable help 
to the layman. It gives him a chance to select from different methods the one 
which flts his needs most clearly. 

d) This reviewer has spent about twenty-five years in pioneering the de¬ 
velopment of pictorial presentation of statistical data. The shortcomings of 
the book under review naturally become most clearly obvious to him at this 
point. Pictographic presentation is advantageous in a limited field. These 
limitations in regard to audience, accuracy, and cost should be stated if we 
are to give a prospective chartmaker practical help. If the chartmaker de¬ 
cides for pictographs he will have to follow a set of simple rules; the authors 
give one of these and leave out all others. Several ways are available to 
duplicate pictorial symbols so that charts can easily be prepared. They can 
be photostated, photo-offset, tjrpe-stamped, or rubber stamped. Symbol 
sheets are sold by some organizations. None of this is mentioned in the 
book. The stencil suggested in the book is not being used by any chartmaker 
known to this reviewer. The statement that “pictorial charts are more ef¬ 
fective if the classification is qualitative” is not correct. There is ample 
literature in existence on pictorial presentation to have avoided the textual 
and graphic shortcomings in this section. 

e) A short book of this type needs a bibliography. Many readers will want 
to know where to turn for more information. 

In 1926 the American Society of Mechanical Engineers formed a Sectional 
Committee on Standards for Graphic Presentation. In April 1936, the So¬ 
ciety published a Code of Preferred Practice for Graphic Presentation — Time- 
Series Charts. On the basis of comments and suggestions received, a revi¬ 
sion, Time Series Charts—A Maniidl of Design and Construdtion, was pub¬ 
lished in November 1938 with the approval of the American Standards 
Association. Nevertheless, the manual is not a compilation of rigid rules. The 
Committee believed that effective presentation calls for flexibility of treat¬ 
ment rather than for standardization, that each chart should be individually 
planned with due reference to the special characteristics of the data and 
the particular use to which the chart is put. Within these limitations there 
is now available a set of well-tested rules for the presentation of time-series 
charts. The authors could have performed a real service to students and 
businessmen if they had tried to popularize the more important elements 
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contained in the manual. At present this little volume, Time^Series Charts, 
is still the best guide for those who want to draw charts of the conventional 
type. 


Control Charts: An Introduction to Statistical Quality Control. Edward S. Smith 
(Professor of Mathematics, University of Cincinnati, Cincinnati, Ohio), New 
York 18: McGraw-Hill Book Co., Inc. (330 West 42nd St.), 1947. Pp. xi, 161. 
$3.00. (London, W.C. 2: McGraw-Hill Publishing Co. Ltd. [Aldwych House, 
Aldwych]. 15«.) Two reviews follow: 


Review by Joseph M. Juran 

Professor of Administralive Engineering and Chairman of the Department 
New York University, New York, N. Y. 


H ere is another of the numerous books now emerging to expound the 
use of statistical methods to control quality of product. 

There is a good discussion of the use of control charts for averages, ranges, 
and number or fraction of defectives. Visual aids in the form of charts or 
tables are well prepared. The exercises are well chosen. 

The claim that the book is “a test for the busy executive” is wishful think¬ 
ing. In this work, and for that matter, in others, there is little evidence that 
the authors have truly grasped the quality problems of the industrial execu¬ 
tive. 

Industry's problem has been and still is to make product to meet specifica¬ 
tions. The statistical methods, if they are to be grasped by industry, must be 
presented as a means to the solution of that problem. The emphasis must, in 
the first instance, be not on (a) how to achieve controlled processes; the em¬ 
phasis must rather be on (5) the fact that through a controlled process it is 
easier to make products to meet specifications. 

To a statistician, (&) seems only too obvious; ergo the executive should 
take (6) for granted, and thus only (a) remains to be learned. The precise 
opposite is true. The practical problem is to convince the executive of the 
truth of (6). Even after he is convinced of (b) he will ordinarily not be con¬ 
cerned with (a) ; instead he will delegate the learning of (a) to another. 

With the exception of the unrealistic first chapter Professor Smith’s book 
can well serve as a supplemental text for undergraduate engineers. 


Review by Mason E. Wescott 
Assistant Professor of Mathematics, Northwestern University 
Evartston, Illinois 

P ROFESSOR Smith has long been a successful teacher of college mathe¬ 
matics. During and since the war, he has participated as an instructor in 
numerous 8- and 10-day intensive training courses in statistical quality 
control for industrial people. He has conducted several successful in-plant 
training courses and has served as technical advisor for the installation and 
operation of statistical quality control programs. Drawing on this back- 
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ground of theoretical and practical experience, he has contributed Control 
Charts to the McGraw-Hill Industrial Organization and Management Series. 

It has been the author’s intention to write a text “for men and women 
who wish to acquire knowledge of the methods and charts of statistical qual¬ 
ity control, but who are too busy to have an interest in the mathematical 
aspects of the methods.” He believes that “it is more important than ever 
that everyone concerned with such production (production of goods of high 
quality) should be able to make and read control charts.” 

With the latter point of view, this reviewer is in complete agreement, 
but he does not feel that such understanding can be acquired without more 
exposure to things mathematical than the “busy executive” generally feels 
he can or wants to endure. As a matter of fact, the innocent reader who be¬ 
gins Professor Smith’s book expectantly hoping to be painlessly initiated into 
the mysteries of statistical quality control, soon finds himself face to face 
with formulas and exposition that are pretty likely to appear frighteningly 
mathematical to him. That this is so, should not be held against the author, 
because Professor Smith has very nearly succeeded in keeping to a mini¬ 
mum the formal aspects of his subject. Rather, it is convincing testimony 
to the fallacy of trying to undertake an expository discussion of a technical 
subject, be it even so simple as control charts, without sooner or later having 
to use symbols and language that to a great many nonprofessional people 
must seem decidedly mathematical. 

Chapter 1 is probably the best chapter in the book from the point of view 
of the audience to whom the book is addressed. Here the author introduces 
the definitions he proposes to follow for such concepts as “control,” “qual¬ 
ity,” and “satisfactory control of quality,” distinguishes well between in¬ 
spection for the purpose of controlling quality and inspection which merely 
sorts good product from bad, outlines pointedly the fundamental role of a 
control chart as a criterion for telling when to take action and when to leave 
well enough alone, and lists ten benefits to be expected from a properly 
administered control chart operation. One may very well take exception to 
parts of the definitions given, but on the whole, they are acceptable for the 
purposes intended. The statement that “A production process is said to he 
producing goods of satisfactory quality if 99.7 per cent of the items are satis- 
factory in quality” is, to this reviewer, the kind of a statement he would 
rather not see used with the emphasis it is made to carry here. 

The remaining chapters are, for the most part, well organized and clearly 
presented. One becomes conscious of a tendency to emphasize details rather 
than to paint the picture in broad outline. The latter treatment would have 
seemed more consistent with the objective of “selling” the subject to an 
audience which is not supposed to be interested in details. However, Profes¬ 
sor Smith is understandably more the teacher than the salesman in these 
chapters, and as such, he anticipates and tries to remove the mental hazards 
that experience has taught him block the student’s progress in learning how 
to make and use control charts. Certainly, if the reader will exercise the 
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patience and intellectual curiosity to study through these chapters, working 
the problems proposed as he goes, he will end up with at least the mechanical 
skill to make a control chart, and a somewhat better than bowing acquaint¬ 
ance with the operational meaning of such a chart. 

In these chapters a number of technical points appeared questionable to 
this reviewer, but most of them are not serious enough to warrant detailed 
discussion here. Three points, however, cannot be ignored. 

o) There is an irritating looseness with which the word "normal” is used. 
Such phrases as "normal, symmetrical distribution,” "normal average,” 
"normal and approximately symmetrical frequency distribution,” "normal 
process” occur repeatedly without the benefit of any consistently adhered to 
definition; in fact, without any "definition” at all! Unless the reader has un¬ 
usual insight or sophistication, he may very well end up with the happy de¬ 
lusion that everything will be just dandy if only it can be described as "nor¬ 
mal.” The word is simply overworked in the last four chapters without re¬ 
serving to it the distinction it rightfully possesses in statistics. 

h) It is the opinion of this reviewer that on pages 43-52, and elsewhere in 
subsequent discussions. Professor Smith overplays his hand in the matter of 
testing control against product specifications. In the first place, the practice 
of putting "danger limits” (threensigma limits for individuals) on a chart for 
averages is questionable, particularly if the chart is to be kept at the ma¬ 
chine. Exhibiting the to-be-expected spread for individuals on a chart for 
averages simply invites trouble, because the operator finds it hard to under¬ 
stand why he must keep the points within the narrower band set by control 
limits for averages when the "danger limits” are way out beyond this band. 

In the second place, the discussion manages to convey the impression (al¬ 
though the contrary is explicitly stated on page 52) that there will frequently 
be hair-splitting closeness between a "danger limit” and a blueprint specifi¬ 
cation wliicli has to be resolved before a controlled process can be adjudged 
in satisfactory control. The criterion described is not in error: it simply 
overemphasizes a relatively unimportant point. Time might better have 
been spent in showing how to estimate the per cent of production likely to be 
out of tolerance when control is not satisfactory, and in discussing more fully 
the problem of balancing the cost of sorting and/or rework against that of 
maldng process or specification changes that will yield control safely within 
acceptable blueprint tolerances once the optimum capabilities of the process 
have been determined. 

c) The discussion in Chapter 4 uses the word "lot” where the word "sam¬ 
ple” would ordinarily be preferable. The beginner reading this chapter must 
surely experience a sense of confusion when, after digesting 21 pages in which 
he has had the word "lot” drilled into him at every stage, he suddenly dis¬ 
covers on page 106 that a box full of beads is a "lot,” but a paddle full of 
beads is a "sample,” and that now it is the sample that is to be inspected 100 
per cent rather than the lot as he has previously been led to expect! Evi¬ 
dently the author intended this chapter to deal with attribute inspection for 
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process control rather than for acceptance sampling, although, of course, 
the two are not mutually exclusive. But if inspection for process control is 
intended, it would seem more consistent and realistic to characterize a 
“day’s production” (which the author says must be inspected 100 per cent!) 
as a “sample” from the cause S 3 rstem represented by the production stream 
than it is to describe these items as a “lot.” 

Furthermore, it would be entirely jjossible for the beginner to get the 
impression from this chapter that 100 per cent inspection (in the ordinary 
sense) is required to keep a fraction defective chart. Many plants exercise 
very satisfactory process control through the maintenance of fraction defec¬ 
tive charts based on samples that are a very long way from being what the 
average man has in mind when he finds himself reading about 100 per cent 
inspection of a lot. It is the sample that is inspected 100 per cent, and it would 
be wiser to say so, rather than to use language that can be so easily misin¬ 
terpreted. 

Table II appears to be potentially useful in simplifying the calculations 
required for control limits on p and np charts. Its use is amply illustrated in 
Chapter 4. 

The typography is good; tables, charts, and diagrams are well done and 
legible. The book is easy on the eyes. Except for a few missing punctuation 
marks, only two errors were found: on page vi, the name should be Paul 
Peach, and on page 142, the bar has been omitted over the R in the third line 
from the top. 

Control Charts could prove useful as a text for a short training course at an 
elementary level. As a textbook for untutored home-study or as a “quickie” 
for the casual reader who wants to see the forest but not the trees, it is the 
opinion of this reviewer that the book is likely to prove disappointing. 


Methods Statlstiqiies Modemes des Administrations Fid^rales anx £tat8-irnis. 
Pierre Thionet. Paris; Hermann & Co. (6 rue de la Sarbonne), 1946, Pp. 
117. 


Review by W. Edwards Deming 
Adviser in Sampling, Bureau of the Budget 
Washington 

T his book is a unique and definite contribution toward improved statistical 
practice, as well as a delight to read. The author exhibits unusual ability 
in theory and extraordinary clarity in presentation. He recognizes the need 
for proper field organization in conducting sample surveys, and appreciates 
the need for reliable figures in government administration. The study of this 
book has been a pleasing experience for the reviewer. It is interesting to see 
how we are viewed from across the Atlantic, and gratifying to find the essen¬ 
tial points of progress in the United States so clearly pointed out. Here, we 
are too close to give a coherent account of what has been happening. More- 
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over the reviewer feels a personal debt to Mr. Thionet for providing in one 
cover a clear exposition of much published research that was difficult in its 
original form. 

The book was perhaps intended mainly as a digest of the essential mathe¬ 
matical papers that have made impact on statistical practice in the United 
States. Actually, it is much more than that. It is a manual as valuable to the 
nonmathematical administrator as to the student of theory. The author 
pleads for the use of figures in government administration in place of abstract 
theorizing and political maneuvering. Now that sampling has been developed 
to its present stage of reliability, real figures instead of fiction are possible 
and there is little excuse for operating without them. Progress is not achieved 
by shackling the theoretical statistician, but by giving him a chance. The 
author is fearless: "Et cette Evolution de la m^thologie statistique et m6me 
de la statistique math^matique risque de rester mal connue en France oh les 
personnes peu nombreuses) qui connaissent et utilisent la m^thode 
statistique, ne s’int6ressent trop souvent qu*5l Tun de ses aspects: pratique, 
th4orique, math^matique, administratif, etc., alors que nous entrons dans 

un domaine oh ila s'interp^n^trent tons_En Prance, oil d'ailleurs le nom 

m6me de statisticien, jadis Equivalent de celui d’actuaire, a disparu de la 
hiErarchie administrative officielle, les conditions sont profondEment differ- 
entes ... Et Pensemble de la nation fait preuve d'une prEdilection marquEe 
en faveur des idEes abstraites de prEfErence aux rEalitiEs chifi[rEes, de sorte 
qu^un gouvernement ou une assemblEe se prEoccupera rarement de Favis 
des statisticians quand une dEcision doit Etre prise” (pp. 4-6). 

This book is destined to be widely used. Fortunately a large part of the 
world reads French. With the coming census of population and agriculture, 
which most countries have agreed to take in 1950-61, there is an appalling 
need for sampling. The most serious handicap is official deafness, but lack 
of trained personnel and suitable texts is almost as serious. No one book will 
meet all needs, but every good one will help. In many countries, proper 
sampling procedure offers the only hope of obtaining census information of 
sufficient reliability and speed to be useful. Even in this country, with a well- 
developed census, sampling is used not only for current information on em¬ 
ployment, housing, cost of living, characteristics of farms and farmers, and 
many other purposes, but was also used concurrently with the complete 
censuses of 1940 and 1945 (and will likewise be so used in 1950) with the 
aim of broadening the scope of the census and speeding up the returns. 

The student of theory will find in this book an ably written and unified 
account of the mathematical researches that have guided progress in statis¬ 
tical practice. It is impossible to portray either its brilliant selection of con¬ 
tents or its lucidity in a review, but a hasty summary would include a non¬ 
mathematical introduction to the general principles of sampling, its ad¬ 
vantages and disadvantages; descriptions of the U. S. Census tabulations 
and their use in sampling; appreciation of the fact that sampling is not only 
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an expedient of war-times but an instrument of stability in peace; some gen¬ 
eral theory and remarks on application. Then follow five chapters and two 
annexes, in which are beautifully expounded in uniform notation a number 
of methodological researches that have had impact on modern practice. 
Works of Hansen and Hurwitz, Neyman, the Madows, Cochran, Jessen, 
Stephan, Stock, Frankel, Geary, Hauser, Snedecor, King, H. Fairfield 
Smith, and Walter Hendricks are included. Unfortunately the author missed 
the profound researches of Mahalanobis, whose reports on sampling for the 
acreage of jute and other crops and conditions in Bengal have been recorded 
from time to time since 1937 if not earlier. Mahalanobis's magnificent article 
"On large-scale sample-surveys” {Philos. Trans. Royal Soc. 1944) probably 
reached Paris a bit late for inclusion: this article not only described Mahalan- 
obis^s brilliant researches in the use of grids in agricultural sampling, but 
also lays down the framework into which all sampling procedure must here¬ 
after be fitted. This framework consists of four steps: 

i. Decide in advance on the widest permissible sampling tolerance, and 
specify also what measurements are desired on the differences between 
various alternative questionnaires, definitions, methods of training, and 
between interviewers, respondents, etc. 

ii. Design the cheapest plan that will meet these requirements and stay 
within the limitations of personnel and physical equipment available and 
any administrative restrictions that must be imposed. Compute the ex¬ 
pected cost. 

Hi. Reconsider Steps i and ii in view of the expected costs and precision. 
Perhaps relax the prescribed precision or forego some of the aims to cut 
costs; or tighten up the precision if the cost is not too great. 

iv. After the results of the survey are in, compute the costs and the 
sampling errors and the differences between the various alternative pro¬ 
cedures that were specified. This appraisal of the results is independent of 
the assumptions that were made in the planning. It provides further knowl¬ 
edge of the proportions and variances in the universe, and of costs, so that 
a cheaper job can be done next time. 

It is to be noted that these steps, which constitute modern statistical prac¬ 
tice, were impossible and unthinkable until the theory of probability was 
introduced into the design of surveys and experiments. Thionet is clear on 
these points, although the steps as enumerated above are not stated ex¬ 
plicitly. 

The author himself has made a number of notable contributions in theory, 
not only by filling in gaps but by extension as well. By coincidence these 
extensions sometimes follow lines of unpublished recent researches that are 
even now profoundly modifying federal statistical practice. One such con¬ 
tribution is in the topic of estimationy in which the author does not shrink 
from the use of biased estimates. Given the simple case of Bernoulli sampling 
from each of L urns, three plans of estimating p are discussed from the stand¬ 
point of consistency, bias, and variance (pp. 28-30). If Xi is the number of 
black balls in a sample of n,* from the tth um, three possible estimates are 
these: 
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Then follows a discussion regarding comparisons of the three variances, with 
and without stratification, and under various systems of apportioning the 
sample amongst the strata. The author makes it clear that the aim in making 
an estimate is to come forth with a number whose reliability can be controlled 
and measured, whether it be biased or unbiased; that the procedure of esti¬ 
mation is part of the sampling plan; and that the cost of the sample and the 
precision of the results will depend on the procedure of estimation as well as 
on the method of selecting the households that are to be interviewed. 

A review is not complete without one criticism. The reviewer fears that 
the section-heading on page 13 "Pas d’4chantillonnage sans Taide des recen- 
sements” will lead the uninitiated to suppose (as so many of the laity now 
suppose) that a sample must rely on a complete census for the total number 
of people; that the sample can only give estimates of the proportions of 
people in various classes, and that, even then, it is well to have some census 
information as a check. Information from a previous census is indeed useful 
in designing a sample, as Thionet points out, but it is not necessary. Clever 
use of available census information for stratification, for ratio-estimates, and 
for drawing areas with probability in proportion to previous size, will be 
effective in decreasing the standard error of subsequent samples, and hence 
in cutting costs. But census information is not n 3 cessary; detailed maps can 
be used for defining areal units and are often so used even when census in¬ 
formation is available. Obsolete census informat on and obsolete directories, 
civil and parish lists, are useful for sampling bee luse, with proper techniques 
of estimation, the errors in the prior information cancel out, leaving the 
sample estimates unbiased. 

Mr. Thionet deserves the praise and thanks of all statisticians. The re¬ 
viewer closes with the hope that translations of the book into Spanish and 
Chinese will appear in time for assistance in planning the world census of 
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population and agriculture in 1950-51 in places where French is difficult to 
read. An edition in English would likely find high favor as a text and reference 
book not only here but in other English-speaking countries, notably in coun¬ 
tries like India where statistical teaching and practice have reached high 
levels. 

An Extension Cotixse in Statistical Quality Control. A. E. R, Westman (Director, 
Department of Chemistry, Ontario Hesearch Foundation, Toronto 5, Canada). 
Reprint of series of articles which appeared in Manufacturing and Industrial 
Engineering. Toronto 1, Canada: Consolidated Press Ltd., 1947. Fp. 58. Paper. 
$1.25. Two reviews follow: 

Review by A. C. Cohen, Jb. 

Associate Professor of Mathematics 
University of Georgia 
Athens, Georgia 

T his syllabus was originally prejMired for use in statistical quality control 
courses which Dr. Westman conducted during the war under sponsor¬ 
ship of the Canadian Government for the benefit of Canadian industry. The 
Canadian courses were closely patterned after those offered in this country 
by OPRD for American war industry, but as reflected by the syllabus, the 
Canadian courses were even more condensed than the intensive OPRD 
courses. 

Frequency distributions, control charts for measurements, control charts 
for counts, acceptance sampling with particular emphasis on the use of the 
Dodge-Romig tables, installation of a statistical quality control system, and 
other related topics are among the subjects discussed in the thirteen two- 
hour lessons which comprise the course. The various lessons include lectures 
to be presented by the instructor, demonstrations in which both the in¬ 
structor and class members participate, and laboratory periods during which 
class members execute assigned exercises under the instructor’s supervision. 
The numerous demonstrations which are described in detail effectively illus¬ 
trate the theory of probability as it concerns statistical quality control, and 
the many exercises afford the student practice in applying the statistical 
techniques of quality control to practical situations. 

This publication will unquestionably be a welcome addition to the quality 
control library of anyone concerned with the training of quality control per¬ 
sonnel. It is particularly recommended to anyone confronted with the task 
of setting up and conducting a course of instruction in statistical quality 
control at the elementary level. Unfortunately its usefulness is marred by a 
somewhat lengthy list of errata. Most of the errata are typographical, but 
several involve formulas which if not corrected will certainly prove confusing 
and annoying to the uninitiated reader. A more clearly stated distinction be¬ 
tween sampling for process control and sampling for acceptance (i.e. disposi¬ 
tion) of product would be desirable. In this connection, although Lesson 12 is 
concerned with acceptance sampling, that fact is not explicitly stated, and 
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the reader having no previous knowledge of the subject will likely find the 
omission confusing. The subjects of correlation and contingency tables 
which are included in Lesson 13 may prove too advanced for a first course in 
quality control for anyone who has not had previous training in statistics, but 
if desirable, these topics can be omitted without detriment to other material 
in the course. 


Review by Paul Peach 
Associate Professor, Institute of Statistics 
The University of North Carolina, Raleigh, N. C. 

D uring the war, in connection with the 8-day courses in statistical quality 
control organized by the U. S. Office of Education and the War Production 
Board, there was prepared a study manual by Professors Holbrook Working 
and Edwin G. Olds which more or less stabilized the pattern of instruction 
for those courses. That manual was never published, but several of its legit¬ 
imate children have attained the dignity of print. Dr. Westman’s pamphlet 
contains an introduction in which this parenthood is specifically ac¬ 
knowledged. 

The student begins by making some histograms and calculating a few 
means and standard deviations. Three following lectures deal with control 
charts for measurements, three more with the charts for p and c, and then 
three with acceptance sampling. The last three bear the titles "Installing a 
Quality Control System,” "Other Sampling Inspection Schemes,” and “In¬ 
troduction to Advanced Topics.” 

The presentation is, I think, an improvement over previous ones; the 
style is clear, and each lecture is illustrated by diagrams, tables of data, and 
similar aids. A laboratory or demonstration period is provided for at the 
close of each meeting. Most of the chapters conclude with a list of special 
terms, though without definitions. Taking the work as a whole, one might 
say that for the most part it follows the pattern which was created in the 
years 1942 to 1944, treating the customary subjects in the established way, 
with just about enough novelty to justify a separate existence. 

There may be some difference of opinion about these novelties. Dr. West- 
man includes less theoretical statistics than did the old WPB course, and I 
should be inclined to go even farther and eliminate theory altogether. The 
avowed viewpoint of the pamphlet is that statistical quality control is a 
branch of industrial engineering, and while I disagree with this idea, yet I 
agree entirely that a short course for engineers should skip the statistical 
theory. A noteworthy recent contribution to the literature is the new British 
Standards Institution publication Fracti<m-Defec1ive Charts for Quality 
Control (B.S. 1313:1947) which has been almost completely purged of statis¬ 
tics. 

Dr. Westman's final chapter treats control charts for composites, correla¬ 
tion, and the chi-square test. Presumably this chapter is intended as a sort 
of bonus, since it seems unlikely that any lecturer would attempt to cover 
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this much ground in a single period. Here again we may ask whether, when 
we can at best do so little, it would not be better to do nothing. Nevertheless, 
since the chapter comes at the end, the instructor can if he likes omit it, 
and no harm is done. 

This last chapter means, I suppose, that engineers realize that the usual 
introduction to control charts and acceptance schemes is only a beginning, 
and that the value of other statistical tools is becoming recognized. Unfor¬ 
tunately, most of the present literature on statistical quality control ends in 
a blind alley; the student who wishes to learn advanced methods has to 
retrace his steps and start all over with new definitions, new concepts, and a 
new notation. In his first twelve chapters Dr. Westman follows the current 
writings on control charts, and of course is in a difficulty in his thirteenth. 
He cannot escape introducing the “degrees of freedom^ concept, and gives 
formulas for the number of degrees of freedom appropriate to correlation 
coefficients and contingency tables; I am afraid that with this much intro¬ 
duction the concept will still seem pretty esoteric to his readers. There is 
room in our educational system for a manual that will do what Dr. Westman 
has done, but with a viewpoint and notation that will enable the student to 
study further if he likes without discarding most of his elementary instruc¬ 
tion. 

A word about typography may be in order. Most printed works, especially 
first printings, contain typographical errors; with these I am not concerned. 
Print shops not accustomed to mathematical text usually have trouble with 
it, especially with equations and other algebraic expressions which must be 
set by hand. Frequently, if special mathematical types are not available, it is 
necessary to use makeshifts. A lot can be done in this way if the author and 
the typesetter work closely together; but in the present work the handset 
portions are really a mess. Dr. Westman was also done dirt by in other ways; 
for instance, his publishers have not even bothered to use uniform style for 
chapter headings. The presswork and paper are good, but some of the edi¬ 
torial work is plain slovenly. 

Toleranser och Passninger. Stig Zetterhmd. Stockholm, Sweden: Sveriges Meka- 
niska Verkstaders Forbund, 1945. 

Review by Cecil C. Cbaig 
Professor of Mathematics, University of Michigan 
Ann Arhcr, Michigan 

O F THIS rather compendious book on tolerances and fits written by a 
member of the staff of the famous Swedish manufacturer of ball bearings, 
the SKF company, only chapter 8 is of general interest to a statistical audi¬ 
ence. To those who have devoted at least a share of their energy in recent 
years to advocating the use of statistical methods in the control of the quality 
of articles produced by repetitive processes, it is gratifying to see this much 
recognition given in a book like this to the importance of probability and 
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frequency laws, but it is also disappointing to see that the development here 
still stops well short of adequately conveying to those concerned with manu¬ 
facturing interchangeable parts how useful statistical tools can be to them. 

In tliis chapter the fact is brought out that the actual sizes of a part pro¬ 
duced to the same nominal dimension are subject to variations which, if 
due to chance only, tend to be distributed in a bell-shaped frequency curve, 
very often usefully summarized by the normal frequency law, hence by 
means of the average and standard deviation. It is well illustrated that the 
play between two assembled parts, such as a hole and an axle, when each is 
subject to random variation, even of a rather arbitrary sort, tends to be so 
distributed. The chief problem actually dealt with is that of the variability 
to be expected in the sum of the dimensions of two assembled parts, when 
the variability of each is known. 


Conference Papers: First Annual Convention, American Society for Quality 
Control and Second Midwest Quality Control Conference, Hotel Sherman, 
Chicago, Illinois, Thursday-Friday, June S and 6, 1947. Rochester 8, N. Y.: 
the Society (c/o Alfred L. Davis, Treas., Rochester Institute of Technology), 
1947. Pp. vi, 284. Paper, lithotyped. $3.50. For a complete list of the papers and 
contributors in this volume, see 1431-2, 1438-9, 1453-5, 1528, 1562, 1650, 1665, 
1691, 1712, 1714, 1718, 1720, 1761, 1767, 1770, 1776, 1781, 1791, 1847-8, 1853, 
and 1861 in “Statistical Methodology Index, No. 11” in the March 1948 issue of 
this Journal. Two reviews JoUow: 

Review by H. A. Freeman 

Associate Professor of Statisticsj Massachusetts Institute of Technology 
Cambridge, Massachusetts 

F ive papers, three of them by corporation presidents, deal with broad 
aspects of quality cootrol, such as the views of management and the 
problems of organization and administration of quality control programs. 
Seven pa])ers, chiefly by Professors Burr of Purdue and Wescott of North¬ 
western, describe, quite nonmathematically, the app.'iratus of elementary 
statistical (luality control. These include quality control charts for continuous 
and discroto varialdes, single and double sampling inspection for attributes, 
and the rudiments of linear correlation. Fifteen papers are devoted to de¬ 
scriptions of successful applications of elementary quality control to particu¬ 
lar problems. Some of these deal with subjects to which the methods of 
statistical quality control are not often applied, for example, personnel data, 
time study, stockroom inventories, waste and personnel efficiency. 

There remain two papers not classified above. One is a very good review 
of modern methods for variables inspection, including nonccntral t, by Pro¬ 
fessor Olds of Carnegie Institute of Technology; from a statistical point of 
view it is the most interesting paper of all. The second paper, by Professor 
Peach of North Carolina, discusses the teaching of statistics to engineering 
students. 
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These are formative years for organized quality control and this is reflected 
in the proceedings of the first national conference. As the field begins to 
establish itself, it relies almost exclusively on a few elementary, easy-to- 
understand methods, almost all of which stem from the pioneering work of 
Shewhart, Dodge and others of the Bell Telephone Laboratories. More 
complicated, though perhaps not more effective, methods will undoubtedly 
come later. Another feature of this volume is its occasional evangelism. There 
will probably be less of this at later conferences for it seems certain that 
quality control methods will ultimately be very generally employed through¬ 
out American industry. 

Finally, a word to would-be purchasers. This volume should interest 
anyone who is actually or potentially interested in industrial quality 
control. Administrators, plant personnel, teachers, and students will all find 
something in it for them. Its contents will not generally interest professional 
statisticians, except as they suggest problems which might be attacked 
by different methods. 

The book is produced by the photo offset process and it is paper bound. 

Review by Eugene L. Grant 
Professor of Economics of Engineering, Stanford University 
Stanford, California 

O N JUNE 5 and 6, 1947, the second Midwest Quality Control Conference 
was held in Chicago in combination with the first annual convention of 
the American Society for Quality Control. Some 38 papers were presented 
dealing with statistical quality control applications in many different in¬ 
dustries including radio and other electrical products, automobile parts, 
plastics, temperature controls, Scotch tape, pens and pencils, surgical dress¬ 
ings, beer, dairy products, veneer, cleansers, garments, chemicals, ammuni¬ 
tion, pharmaceuticals, paper, hybrid corn, and the retail mail-order business. 
Through the energy of the Proceedings Committee of the Midwest Quality 
Control Conference, under the chairmanship of Professor Mason E. Wcscott 
of Northwestern University, this volume containing twenty-nine of these 
papers has been made available to the quality control public. The volume, 
reflecting as it does current problems and current thinking in the quality 
control field, should be a useful addition to the library of any industrial 
statistician or any teacher of industrial statistics or statistical quality con¬ 
trol. 

A number of the papers are really expositions of principles of statistical 
method as applied to various industrial situations. These are at different 
levels of statistical sophistication. A group of five papers by Professor Wes- 
cott and Professor Irving W. Burr of Purdue University constitute a presen¬ 
tation of elementary control chart principles and the elements of acceptance 
sampling. Other papers, calling for greater familiarity with statistical meth¬ 
ods on the part of the reader, deal with such topics as the use of modified 
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control limits, acceptance sampling by variables, statistical techniques for 
research and development, and the fundamentals of correlation. 

Most of the papers, however, are descriptions of specific cases or discussions 
of technical statistical problems or managerial problems arising out of 
specific cases. The number of industries represented would seem to give an 
impressive answer to any industrial manager who gives the familiar response, 
“My business is different,” to the suggestion that he look into the applica¬ 
bility of statistical quality control techniques in his own plant. Nevertheless, 
it should not be concluded from this spread of applications that the time 
has yet been reached when statistical quality control methods are in wide¬ 
spread use in American industry. In many cases the uses described may be 
only in a single plant in a large industry. Some of the authors make it clear 
that they feel that the surface has only been scratched in their organizations. 
In other papers where this is not specifically stated it is evident that, at 
least in June 1947 when these papers were presented, the applications were 
in the beginning stage. 

Several of the papers deal with matters somewhat removed from strictly 
quality control problems. Two papers apply control charts to personnel 
statistics. Another applies statistical methods to the interpretation of time 
study data. A most interesting paper of this type is “Scientific Control of 
Stockroom Inventories” by E. H, Wilson, General Service Manager of Bell 
Telephone Laboratories; this applies probability mathematics to the deter¬ 
mination of ordering points and presents a fresh and stimulating discussion 
which should be of interest to industrial engineers and cost accountants as 
well as to statisticians. 

The final paper in the volume, “Sampling and Sorting Ammunition for the 
Attack on Normandy” by Col. Leslie E. Simon, should, in the opinion of 
this reviewer, become a classic in the quality control field. This is not only 
because of the spectacular nature of the subject matter but also because of 
the effective way in which Col. Simon includes good sound advice for quality 
control men on the nonstatistical matters associated with their jobs. 

It is to be hoped that similar volumes of conference papers will be made 
available from subsequent A.S.Q.O. conventions and subsequent Midwest 
Quality Control Conferences. 



STATISTICAL METHODOLOGY INDEX, NO. 12* 

A QUARTERLY GUIDE TO CURRENT LITERATURE 


Edited by 

Oscar Ebisen Bubos 

Rvigers University 


This bibliographic seniee presents statistical methodology literatuTS {aarUdes^ books, 
theses, and chapters) published inlOJ^B to date. Anonymous references are listed first; 
other references follow in alphabeticoil sequence by firsbnamed authors. Whenever 
known, the position and address of an author are given the first time his name appears 
as sole author or as the fird listed of two or more authors; otherwise, his position and 
address are usually repeated for each of his publications. Volume number, issue 
number (fn parentheses), pagination, and date of issue are given for each journal 
reference. Stars indicate books, theses, and pamphlets; asterisks indicate publican- 
Hons seen by the editor of this Indbz. Authors of papers on statistical methodology 
are invited to send reprints to Oscar K. Buros, Rutgers University, New Bruns- 
wick. New Jersey in order to facilitate prompt accurate listings. Foreign^ 
language papers thould be accompanied hy English translations of titles. Information 
is desired abottt references omitted. 


1877. Batcli House Assignable Causes 
In Density Control. Glass Ind 26(11) :512~ 
3+ N 

1878. Hew Statistical Method of Pre¬ 
dicting Sunspots Aids Radio Propagation 
Forecasts. J FranMin Inst 244(6) :481'<7 
D *47.* 

1879. Refined Methods of Control of 
Cordiness and Workability of Glass During 
Production: Statistical Control Charts Ap¬ 
plied to Daily Measurements of Density. 
Glasa Ind 26(2) :83 F ’45.* 

1880. ADAM, W. B. (Deputy Director 
and Senior Chemist, University of Bristol 
Research Station, Clampden, Gloa., Eng¬ 
land). The Use of Statistical Methods in 
Research on Food Canning: TheAppUca- 
tion of Statistical Methods to Food Prob¬ 
lems. Ancly^ 73(862) :7-ll Ja ’48.* Ab¬ 
stract Chem & Ind (3) :43-4 Ja 18 *47.* 

1881. ADLER, H. A. (Senior Engineer, 
Engineering Department, Line Dedgn Di¬ 
vision, Adams St., Chicago 90, IIL), and 
MILLER, K. W. (Assistant Director, 
Armour Research Foundation, Illinois Jjl- 
statute of Technology, Chicago 16, HL). 
A New Approach to Probability Problems in 
Electrical Engineeiing. Elec Eng Trans 
Sect 65(10) :630-2 O ’46.* 

1882. ANDREWS, T. GAYLORD 


(Scientific Advisor, Research and Develop¬ 
ment Group, Lo^btics Division, Genei^ 
Staff, Department of the Army, Washing¬ 
ton 25, D. C.; on leave: Assistant Professor 
of Psychology, University of Chicago, Chi¬ 
cago 37, Ill.). Statistical Studies in Allergy; 
II, A Factorial Analysis. J Allergy 19(1) :43- 
6Ja'48.* 

1883. ANTHONY, AUDREE D. 
(Chemist), and SOUTHER, HOBART 
(Chief Chemist). (Cone Mills, Greensboro, 
N. 0.) Quality-Control Charts Aid Carding 
and Spinnmg. Textile World 98(3): 111-8 
Mr ’48.* 

1884. ASHBY, ERIC (George Harrison 
Professor of Botany and Director of the 
Botanical Laboratories and Experimental 
Grounds, University of Manchester, Man¬ 
chester, England). Statistical Ecology: II, A 
Reassessment. Bot B 14(4):222-84 Ap ’48.* 

1885. ATHERTON. A. L. (Supervisor 
of Quality Control, Westinghouse Electric 
Corp., East Pittsburgh, Pa.) Lot hy Lot 
Inspection. Mat dt Msth 22(6): 1792 D 
'45.* 

1886. BAILEY, W. G. (Assistant Actu¬ 
ary of the Eagle Star Insurance Co., Ltd., 
38 Moorgate, London E.C.2, England), 
and HAYCOCKS, H. W. (Secretary, Actu¬ 
arial Tuition Service, 6 Staple Inn, Holbom^ 


* Numerous references prepared for this issue of the Izroscx have been omitted because of apace 
limitations. 


356 






STATISTICAL METHODOLOGY INDEX 


367 


liondon W.0.1, England). A Synthesis of 
Methods of Deriving Measures of Decre¬ 
ment Erom Observed Data. J Inst Actu¬ 
aries 73(337): 179-99, discussion 200-12 D 
'47.* 

1887. BARTON, SAMUEL G. (Presi. 
dent. Industrial Surveys Co., Ino., 347 
Madison Ave., New York, N. Y.). Con¬ 
sumer Panels, pp. 155-63, dkcussion 170-6. 
In Measurement of Consumer Interest (jsee 
1903). 

1888. BEALL, GEOFFREY (Research 
Associate, Institute of Paper Chemistry, 
Appleton, Wis.). Quantitative Treatment of 
Density Distribution Pattern on Photo¬ 
graphic Paper as Produced by the Dis¬ 
turbance of a Spotlight Galvanometer. 
Paper Trade J 124(19) :35-7 My 8 '47.* 
Also in Paper Ind 28(12): 1801-2 Mr *47.* 

1889. BEARD, R. E. (Assistant Fire 
Manager, Pearl Assurance Co., Ltd., High 
Holbom, London W.C.l., England). Some 
Notes on Approximate Product-Integration. 
J Inst Actuaries 73(337): 356-403, dis¬ 
cussion 404-16 D'47.* 

1890. BENNETT, B. M. (Research 
Assistant, Statistical Laboratory), and 
NEYMAN, J. (Professor of Mathematics 
and Director of the Statistical Laboratory). 
(XTniversity of California, Berkeley, Calif.) 
The Performance Characteristic of Certain 
Methods for Obtaining Confidence Inter¬ 
vals. Abstract. Ann Malh Stat 19(1): 116 
Mr '48.* 

1891. ^BICEERSTAFF, THOMAS 
ALTON (Professor of Mathematics and 
Head of the Department,tJniversity of Mis¬ 
sissippi, University, Miss.). Certain Order 
Probabilities in Nonparametric Sampling. 
Doctor's thesis, University of Michigan 
(Ann Arbor, Mich.), 1948, 

1892. PICKING, CHARLES A. (Qual¬ 
ify Control Engineer, Hercules Powder Co., 
Wilmington 99, Del.). Industrial Statistics: 
Their Place in the Manufacture of Pulp 
and Paper. Pvlp <fc Paper 49(3): 181-8-f* 
^ 48 .* 

1893. BICKING, C. A. QuaUty Control 
for the Pulp and Paper Industry. Teck 
Ajssn Papers 30:135-9 Je '47.* 

1894. BLACKWELL, DAVID (Piofes- 
sor of Mathematics, Howard University, 
Washington 1, D. C.). A Renewal Theorem. 
I>ahe Modi J 15(1): 145-50 Mr '48.* 

1895. BLANCHE, ERNEST E. (Chief 
Statistician, Logistics Division, General 
Staff, U. S. Army, The Pentagon; and Lec¬ 
turer in Mathematiesd Statistics, American 
University; Washington, D. C.). The 
Mathematics of Gambling. 8di Sei <Sb Malh 
46(401) :217-27 Mr '46.* 

1896. BOCHNER. S. (Professor of 


Mathematics, Princeton University, Prince¬ 
ton, N. J.). finitely Additive Set Ftmetions 
and Stochastic Processes. Proc Nat Acad 
8ci 32(10) :259-61 O '46.* 

1897. BOWIE, O. L. (Mathematician, 
Watertown Arseoal, Watertown, Mass.). 
A Least-Square Ap^cation to Relaxation 
Methods. J Appl Physics 18(9):830-3 S 
'47.* 

1898. BREYER, RALPH F. (Professor 
of Marketing, Wharton School of Finance 
and Commerce,University of Pennsylvania, 
Philadelphia 4, Pa.). Questions on Specifi¬ 
cations for Consumers’ Goods From the 
Point of View of Marketing, pp. 142-6, 
discussion 152-4. In Measurement of Conr 
sumer Interest {see 1903). 

1899. BRUNK, H. D. (Instructor in 
Mathematics, The Rice Institute, Houston, 
Texas). The Strong Law of Large Numbers. 
Duke Math J 15(1): 181-95 Mr '48.* 

1900. BUIST. J. M. (Rubber Service 
Department), and DAVIES, O. L. (Re¬ 
search Statistician, Dyestuffs Division). 
(Imperial Chemical Industries, Blackley, 
Manchester 9, England.) Statistical Evalu¬ 
ation of Variations in Rubber Processes 
and Correlation in Physical Properties. 
RiMer Chem da TedindL 20(1): 288-300 Ja 
'47.* Reprinted from Inst Rubber Ind 
Trans 22(2) :68-8l Ag '46.* 

1901. CATLIN, J. B. (Assistant Techni¬ 
cal Director), and STRIEBY, J. G. (Qual- 
ity Methods Coordinator). (Kimberly- 
Clark Corporation, Neenah, Wis.) Method 
for Evaluating and Reporting Product Con¬ 
trol. Tech Assn Papers 30:199-202 Je *47.* 
Same as 1463. 

1902. CHASSAN, JACK (Chief, Health 
Reports Branch, OfElce of the Surgeon Gen¬ 
eral, Department of the Army, Washing¬ 
ton, D. C.). Estimating Direct Labor Cost 
in Aircraft Production. Ind Axiotum 
3(l);66-f- Jl'46.* 

1903. CHURCHMAN, C. WEST (Asso¬ 
ciate Professor of Philosophy), ACKOFF, 
RUSSELL. (Assistant Professor of Philoso¬ 
phy), (Wayne University, Detroit 1, 
Mich.); and WAX, MURRAY (Instructor 
in Philosophy, Temple University, Phila¬ 
delphia 22, Pa.), Editors. Measurement of 
Consumer Interest. Philadelphia 4, Pa.: 
University of Pennsylvania Press (3622 
Locust St.), 1947. Pp. vi, 214. $3.50. 
(London E.C.4: Oxford University Press 
[Amen House, Warwick Square]. 20s.)* 
For contents, see 1887, 1898, 1906,1910-1, 
1917,1943,1950,1951. 

1904. COHEN, A. C. (Associate Profes¬ 
sor of Mathematics, University of Georgia, 
Athens, Ga.). Generalized Jn Functions. 



358 


AMBKICAN STATISTICAL ASSOCIATION 


Abstract. B Am Math See 64(3) :287 Mr 
*48.* 

1906. CRAWFOED. ALBERT B. (Di¬ 
rector), and BXJRNHAM, PAUL S. (Asso¬ 
ciate Director). (Student Appointment 
Bureau, Yale University, New Haven, 
Conn.) Chap. 2, The Measurement of 
Educational Performance and Basic Sta¬ 
tistical Principles, pp. 21-75. In their Pore- 
easting CoUege AchisDemeni: A Survey of 
Aptitude Tests for Higher Education: Pari 
I, General Considerations in the Measure¬ 
ment of Academic Promise, New Haven 7, 
Conn.: Yale University Press (143 Elm 
St.), 1946. Pp. xxi, 291. $3.76.* (London 
E.C.4: Oxford University Press [Amen 
House, Warwick Square]. 26a.) 

1906. CURTISS. J. H. (Chief, National 
Applied Mathematical Laboratories, Na¬ 
tional Bureau of Standards, Washington, 
D. C.). The Government’s Use of Specifica¬ 
tions for Consumers’ Goods, pp. 146-9, 
discussion 162-4. In Measurement of Con¬ 
sumer Interest (see 1903). 

1907. DALTON, A. G. (Superintendent, 
Quality Control, Western Electric Co., 
file., 196 Broadway, New York, N. Y.). 
Some Engineering Aspects of Qtudity Con¬ 
trol. Mech Eng 70(3): 205-7+ Mr *48.* 

1908. DEEMER, WALTER L., JR. 
(Lt. Col., U. S. Air Force, 400 Cameron 
Ave., Chapel Hill, N. C.), The Power of 
the t Test and the Estimation of Required 
Sample Size. J Ed Psychol 38(6):329-42 
0 *47.* 

1909. DEEMER, WALTER L., JR.; 
HORST, A. P. (Piofessor of Psychology, 
University of Washington, Seattle, Wash.); 
THORNDIKE, R. L. (Professor of Educa¬ 
tion, Teachers College, Columbia Univer¬ 
sity. New York. N. Y.); and WHITNEY, 
A. G. (Foreign Service Staff Officer, Divi¬ 
sion of Foreign Service Personnel, Depart¬ 
ment of State, Washington, D. C.). Ap¬ 
pendix E, The Correction of Correlation 
Coefficients for Restriction of Range, pp. 
699-16. In Records, Analysis, and Test 
Procedures. Edited by Walter L. Deemer, 
Jr, Army Air Forces, Aviation Psychology 
Program Research Reports, No. 18. Wash¬ 
ington, D. C.: Government Printing Office, 
1947. Pp. ix, 621. $2.26.* 

1910. DEMING, W. EDWARDS (Ad¬ 
viser in Sampling, Bureau of the Budget, 
Washington, D. C.). Random Samples and 
Biased Samples, pp. 167-70, discussion 
170-6. In Measurement of Consumer In- 
Urest (see 1903). 

1911. DEMING. W. EDWARDS. Some 
Criteria for Judging the Quality of Surveys, 
pp. 106-22. In Measurement of Consumer 
Interest (see 1903). 


1912. DEMING, W. EDWARDS, and 
SCATES. DOUGLAS E. (Professor of Edu¬ 
cation, D^e University, Durham, N, C.). 
The Need for Statistical Education in EQgh 
School and College. Ed Rec 29(1):72-80 
Ja *48.* 

1913. DURAND, DAVID (National 
Bureau of Economic Research, West 254th 
St. and Independence Ave., New York 63, 
N. Y.). An Appraisal of the Errors Involved 
in Estimating the Size Distribution of a 
Given Aggregate Income. R Scan & Stat 
30(1): 63-8 F *48.* 

1914. FINNEY, D. J. (Lecturer in the 
Design and Analysis of Scientific Experi¬ 
ment, University of Oxford, Oxford, Eng¬ 
land). The Adjustment of Biological Assay 
Results for Variation in Concomitant Ob¬ 
servations. J Hyg 45(4) :397-^06 D *47.* 

1915. FINNEY, D. J. The Principles of 
Biological Assay. Sup J Royal Stat Soc 
9(1-2) :46-81, discussion 81-91 *47.* 

1916. FINNEY, D. J. Statistical Science 
and Agricultural Research. Math Gazette 
31(293) :21-30 F *47.* 

1917. GUTTMAN, LOUIS (Associate 
Professor of Sociology, Cornell University, 
Ithaca, N. Y.). The Cornell Technique for 
Scale and Intensity Analysis, pp. 60-84. 
In Measurement of Consumer Irderest (see 
1903). 

1918. HASKINS, ELMER E. (Assist¬ 
ant Professor of Mathematics, Air Izistitute 
of Technology, Wright Field, Ohio). Qual¬ 
ity Control by Statistical Methods. Math 
Teach 41(4):172-+ Ap *48.* 

1919. HITT, HOMER L. (Professor of 
Sociology and Rural Sociology and Head of 
the Department, Louisiana State Uni¬ 
versity, Baton Rouge, La.). The Use of 
Selected Cartographic Techniques in 
Health Research. Social Forces 26(2): 189- 
96 D *47.* 

1920. HOCHSTIM, JOSEPH R. (Chief 
Statistician), and SMITH, DILMAN, 

M. K. (Vico President). (Opinion Research 
Corporation, 44 Nassau St., Piinceton, 

N. J.) Area Sampling or Quota Control?— 
Three Sampling Experiments. Pub Opinion 
Q 12(l):73-80 sp *48.* 

1921. HOLLEY, J. W. (Graduate Stu¬ 
dent, Department of Psychology, North¬ 
western University, Evanston, Ill.). A 
Note on the Reflection of Signs in the Ex¬ 
traction of Centroid Factors. Psyt^o- 
metrika 12(4):263-5 D *47* 

1922. HOLZINGER, KARL J. (Profes¬ 
sor of Education, University of Chicago, 
Chicago 37, Ill.). Factoring Factors. J Ed 
Psychol 38(6):321-S O *47.* 

1923. HOTELLING, HAROLD (Pro¬ 
fessor of Mathematical Statistics and Head 



STATISTICAL METHODOLOGY INDEX 


359 


of the Department; Associate Director, 
Institute of Statistics; University of North 
Carolina, Chapel Hill, N. C.). Points in the 
Report of the Institute of Mathematical 
Statistics Committee on the Teaching of 
Statistics. Afoih Teach 41(2) :7&-7 F *48 * 

1924. JAMBS, EDWIN (Associate Pro¬ 
fessor of Agronomy, University of Georgia, 
Athens, Ga.). A Modification of the Formu¬ 
lae for Regression. Am 8oc Agrtm J Z9(jSt) I 
546-fi Je '47.* 

1925. JONES, MELVIN D. (Assistant 
Professor of Genetics and Plant Breeding, 
Oklahoma Agricultural and Mechanical 
College, Stillwater, Okla.). The Fraction 
Method of Analysis of Factors: An Aid in 
Teaching Genetics. J Heredity 38(12): 
368-70 D '47.* 

1926. JOSEPH, A. W. (Assistant Actu¬ 
ary, Wesleyan and General Assurance So¬ 
ciety, Steelhouse Lane, Birmin^am 4, 
England). Third-Order Moments of a Bi¬ 
variate Frequency Distribution. J Inst 
Actuaries 73(337):427-9 D *47.* 

1927. JURKAT, ERNEST H. (Director 
of Research, Wroe Alderson, Alderson & 
Simons, Inc., 1905 Walnut St., Philadelphia 
2, Pa.; and Instructor in Marketing, Gradu- 
uate School of Business Administration, 
New York University, 90 Trinity Place, 
New York, N. Y.). Statistical Analysis and 
Interpretation of Chemical Market Re¬ 
search Data. Chem Ind 59(6): 1021-4 D 
*46.* 

1928. KIMBALL, BRADFORD F. 
(Senior Statistician, State Department of 
Public Service, 233 Broadway, New York 
7, N. Y.). Assignment of Frequencies to a 
Com]^etely Ordered Set of Sample Data. 
Trans Am Geophysical Union 27(6): 843-6 
D *46. Discussion by E. J. GUMBEL 
(Associate Professor of Mathematics, 
Brooklyn College, Brooklyn, N. Y.) 
28(0):961-2, author's reply 952-3 D '47.* 

1929. KOGAN, LEONARD S. (Assist- 
ant Director, Institute of Welfare Research, 
Community Service Society of New York, 
105 East 22nd St., Now York 10, N. Y.). 
Analysis of Variance—^Repeated Measure¬ 
ments. Psychol B 45(2): 131-43 Mr *48.* 

1930. KUZNETS, GEORGE M. (A^ 
date Professor of Agricultural Economics, 
University of California, Berkeley, Calif.). 
The Use of Econometric Models in Agri¬ 
cultural Micro-Economic Studies. J Farm 
Econ 30(1): 131-9, discussion 139-41 F 
*48.* 

1931. KUZNETS. SIMON (Professor of 
Economic and Social Statistics, University 
of Pennsylvania, Philadelphia 4, Pa.). 
Measurement of Economic Growth, pp. 
10-34. In The Tasks of Economic History \ 


Papers Presented at the Seventh Annual 
Meeting of the Economic History Associa¬ 
tion, New Haven, Connecticut, September 
12-13, 1947. Supplement 7 of the Journal 
of Economic History, New York 3: New 
York University Press (Washington 
Square), 1947. Pp. 174. Paper. $2.00.* 

1932. LASKER, GABRIEL W. (Assist¬ 
ant Professor of Anatomy, Wayne Uni¬ 
versity, Detroit 26, Mich.). Penetrance 
Estimated by the Frequency of Unilateral 
Occurrences and by Discordance in Mono¬ 
zygotic Twins. With an editorial note by 
CHARLES P. WINSOR (Assistant Pro- 
fessor of Biostatistics, School of Hygiene 
and Public Health, Johns Hopkins Uni¬ 
versity, Baltimore, Md.). Human Biol 
19(4) :217-23, note 223-30 D '47.* 

1933. LAWSHE, C. H., JR. (Professor 
of Psychology, Purdue University, Lafa¬ 
yette, Ind.), and MAYER, J. S. (Mayer 
and Co., Industrial Psychologists, 707 
Timos-Star Bldg., Cincinnati 2, Ohio). 
Studies in Item Analysis; I, The Effect of 
Two Methods of Item Validation on Test 
ReliabiHty. J Appl Psy<hol 31(3):271-7 
Je '47.* 

1934. LIENAU, C. C. (Associate Profes¬ 
sor of Electrical Engineering, Manhattan 
College, New York, N. Y.). Quantitative 
Aspects of Organization. Human Bid 
19(4): 163-216 D *47.* 

1935. MAINLAND, DONALD (Profes¬ 
sor of Anatomy, Dalhousie University, 
Halifax, Nova Scotia). Statistical Methods 
in Medical Research: I, Qualitative Sta¬ 
tistics (Enumeration Data). Can J Res, 
Sect E Med Sci 26(1): 1-166 F '48.* 

1936. ★MASSEY, FRANK J., JR. 
(Assistant Professor of Mathematics, Uni¬ 
versity of Oregon, Eugene, Ore.). Estima¬ 
tion of a Distribution Function by Confi¬ 
dence Limits. Doctor's thesis, Universily 
of California (Berkeley, Calif.), 1947, 

1937. MAY, KENNETH (Associate 
Professor of Mathematics, Carleton Col¬ 
lege, Northfield, Minn.). Probabilities of 
Certain Election Results. Am Math Mo 
65(4): 203-9 Ap '48.* 

1938. MAY, KENNETH. Variation of 
the Probability of Unfair Election Results: 
Preliminary Report. Abstract. S Am Mdh 
Soc 54(3):288 Mr '48,* 

1939. MORLEY, J. F. {formerly Head of 
Development Division); PORRITT, B. D. 
{late Director of Research); and SCOTT, 
J. R. (Director of Research). (Research 
Association of British Rubber Manufac¬ 
turers, 105-7 Lansdowne Road, Croyden, 
Surrey, England.) Reproduribility of Ten¬ 
sile, Permanent Set, and Hardness Test 



360 


AMERICAN STATISTICAL ASSOCIATION 


Results in the Same Laboratory. Rubber 
Chem <fc Tedhnol 20(2) :561-76 Ap *47.* 

1940. MOSER, C. A. (Assifitant Lecturer 
in Statistics, London School of Economics 
and Political Science, University of Lon¬ 
don, London W.C.2, England). Statistics 
and the Citizen. Highway 28:79 F ’47.* 

1941. NEYMAN.J. (Professor of Mathe¬ 
matics and Director of the Statistical 
Laboratory), and SCOTT. ELIZABETH 
L. (Research Assistant and Ajssociate, De¬ 
partment of Mathematics). (University of 
CaJifoiuia, Berkeley 4. Calif.) Consistent 
Estimates Based on Partially Consistent 
Observations. Econometrica 16(1): 1-32 Ja 
*48.* 

1942. PAGE, A. (Lecturer in Mathe¬ 
matics, Borou^ Road College, Ideworth, 
Middlesex, England). Chap. 12, Statistics, 
pp. 243-74. In his Algebra. London E.C.4: 
University of London Press Ltd. (Warwick 
Sguare), 1947. Pp. vii, 346. 18«.* 

1943. PEACH, PAUL (Associate Profes¬ 
sor, Institute of Statistics, North Carolina 
State College, Raleigh, N. C.). Laboratory 
Testing and Actual Quality in the Con¬ 
sumer Goods Field, pp. 138-42, discussion 
152-4. In MeasuremerU ef Consumer Inr 
t&rest (see 1903). 

1944. PER]^, WILFRED (Assistant 
Actuary of the Pearl Assurance Co., Ltd., 
London W.C.l, England). Some Observa¬ 
tions on Inverse Probability Including a 
New Indifference Rule. J Inst Actuaries 
73(337):285-312; discussion 313-34 D ’47.* 

1945. PETERSON, DONALD A. (Re¬ 
search Associate, Life Insurance Aig^cy 
Management Asrociation, 115 Broad St., 
Hartford 5, Conn.), and GULLIHSEN, 
HAROLD O. (Professor of Psychology, 
Princeton University; and Research Di¬ 
rector, College Entrance Examination 
Board; Princeton, N. J.). Appendix E-3, 


Construction and Use of Abacs, pp. 488- 
505. In Personnd Research and Test De¬ 
velopment in the Bureau of Naval Person¬ 
nd. By the Staff, Test and Research Section 
in Cwperation With N.D.R.C. Project 
N-106 and the College Entrance Examina¬ 
tion Board. Edited by Dbwbt B. Stuit. 
Princeton, N. J.: Princeton University 
Press, 1947. Pp. xxiv, 513. $7.50.* 

1946. RICHARDS, PAUL I. (Scientist, 
Physics Department, Theoretical Division, 
Brookhaven Nationsil Laboratory, Associ¬ 
ated Universities, Inc., Upton, N. Y.). 
Probability of Coincidence for Two Peri¬ 
odically Recurring Events. Ann Math Stat 
19(1): 16-29 Mr’48.* 

1947. RYNIN, DAVID (Assistant Pro¬ 
fessor of Speech, University of CaUfomia, 
Berkeley, Calif.). Probability and Meaning. 
J Philos 54(22):689-97 O 23 ’47.* 

1948. SALZER, HERBERT E. (Mathe¬ 
matician, Computation Laboratory, Na¬ 
tional Bureau of Standards, Washington, 
D. C.). Note on Interpolation for a Function 
of Several Variables. B Am Math Soe 
51(4):279-80 Ap’45.* 

1949. SALZER, HERBERT E. Tables 
of Coefficients for Interpolating in Func¬ 
tions of Two Variables. J Math Physics 
26(4):294-305Ja’48.* 

1950. THURSTONE, L. L. (Charles F. 
Grey Distinguished Service I^fessor of 
Psychology, University of Chicago, Chi¬ 
cago 37, Ill.). The Prediction of Choice, pp. 
39-55. In Measurement of Consumer Inter^ 
est (see 1903). 

1951. WILBB, S. S. (Professor of 
Mathematics and Director of the Section of 
Mathematical Statistics, Princeton Uni¬ 
versity, Pdnceton, N. J.). Research on 
Consumer Products as a Counterpart of 
Wartime Research, pp. 135-8, discussion 
152-4. In Measurement of Consumer Inter* 
est (see 1903). 



JOURNAL OF THE AMERICAN 
STATISTICAL ASSOCIATION 


Volume 43 


Septembeb 1948 


Number 243 


ARTICLES 

Significance of Inventories in the Current Economic Situation 

.Louis J. Pababiso 361 


Measuring Physical Inventories. Weib M. Bbown 377 


On the Determination of Sample Sizes in Designing Experiments • 

.Marilyn Harris, D. G. Horvitz and A. M. Mood 391 


Application of the Theory of Extreme Values in Fracture Problems . 

. Benjamin Epstein 403 


Statistics in Insurance 

IV. Problems with Sampling Procedures for Reserve Valuations 

.George C. Campbell 413 


V. Recent Developments in Graduation and Interpolation .... 

. Thomas N. E. Greville 428 


VI. Sampling Errors in Mortality and Other Statistics in Life Insurance. 

.Donald D. Cody 442 


The Edge-Marking Method of Analyzii»g Data . . L. L. Thurstone 451 


The War Production Board's Slatisticsd Reporting Experience, Parts II 

and III. David No vice and George A. Steiner 463 


Statistical Methodology Index, No. 13 . . Oscar Krisen Bubos 500 


BOOK REVIEWS 

Chapin, F. Stuart, Designs in iSfoczoZo^icaZ Research . . . 

.O. Kempthobne 489 


Hill, A. Bradford, Principles of Medical Statistics, Fourth Edition . . 

. J. Yerushalmy 492 













Hotelling, Habold; Bartkt, Walter; Deming, W. Edwards; Fried¬ 
man, Milton; and Hoel, Paul, The Teaching of Statistics, A Report of 
the Institute of Mathematical Statistics Committee on the Teaching 
of Statistics .Truman L. Kelley 493 

Pearson, E. S., chairman; R. G- D. Allen, H. Campion, William Elder- 
ton, C. Oswald George, R. F. George, M. Greenwood, D. Heron, 

J. O. Irwin, II. Leak, E. C. Rhodes, R. Stone, L. H. C. Tippett, and 
J. Wish ART, The Teaching of Statistics in Universities and Colleges, 
based upon a report by the Royal Statistical Society’s Committee on 
the Teaching of Statistics. Truman L. Kelley 493 

Levens, a. S., Nomography . M. Kac 496 

Samuelson, Paul Anthony, Foundations of Economic Analysis . . . 

. Gerhard Tintner 497 


Index to Volumes 1-34,1888-1939, may be obtained from the ASA. The Journal 
is also indexed in the Industrial Arts Index, the International Index to Periodi¬ 
cals, and the Public Affairs Information Service Bulletin. 







JOURNAL OF THE AMERICAN 
STATISTICAL ASSOCIATION 


Number S4S SEPTEMBER 1948 Volume 48 


SIGNIFICANCE OP INVENTORIES IN THE 
CURRENT ECONOMIC SITUATION 

Louis J. Paradibo* 

The Econometfic Institute, Inc. 

It is the purpose of this paper to: (1) appraise the current 
inventory level and recent additions in relation to other 
investment items in the economy and to the total value of 
output; and (2) analyze the current inventory levels relative 
to activity by broad industry groups. 

S INCE V-J Day, the course of inventory changes has led at times to 
optimism and at other times to apprehension regarding the future 
prospects of business. The inventory accumulation which occurred in 
the last half of 1945, for example, was salutary, since it reflected the 
gradual filling of the civilian pipelines which were depleted during the 
war years, and foreshadowed increased production of civilian goods. 
The rapid accumulation of inventories which occurred in the second half 
of 1946, on the other hand, was interpreted by many observers as a 
sign of impending decline in business activity. 

Those who are influenced in their business outlook by the course of 
inventory changes fail to assign to inventories their proper role in the 
business cycle. In theory, the part which inventories play in business 
fluctuations has been extensively developed. But theory is often cast 
aside when inventory accumulation reaches unduly large proportions. 
It is then argued that since inventory additions arc contributing signifi¬ 
cantly to the total value of output and since such a high rate of accumu¬ 
lation cannot go on indefinitely, that when the inventory accumulation 
stops, business activity must turn down unless the inventory loss is 
offset by rising activities elsewhere in the economy. This reasoning 
overlooks the fact that inventory changes are a symptom and not a 
prime cause of business fluctuations. The inventory position at the 
beginning of a change in the direction of the cycle can moderate or 
accentuate the movement; it can spell profits or loss for an individual 
firm; and it can prolong or shorten the cyclical swing. In this sense, the 

* Paper deUvered at the 107th Axmual Meeting of The American Statistical Association in New 
York City on December 28,1048. 
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appraisal of the trends and levels of inventories by kinds of business 
and by types of goods is of vital importance to business men as a guide 
to inventory policy. Inventory trends, however, are not the guides to 
use in forecasting the turning points of a cycle since the historical rec¬ 
ords indicate that they tend to lag behind a number of other basic eco¬ 
nomic factors in the timing of the turning points. 

THE MAGNITUDE OP RECENT INVENTORY CHANGES IN THE ECONOMY 

Although inventory accumulation proceeded at a rapid rate soon 
after V-J Day, it was not until the second half of 1946, following the end 
of price controls that the rate of accumulation became so great as to 
cause considerable concern. During the last quarter of 1946, business 
added a physical volume of inventories in the prices current at the time 
amounting to the annual rate of 5.5 billion dollars. In book value, this 
represented an addition of an annual rate of 16 billion dollars. Thus, the 
current value of additions to inventories in that quarter accounted for 
nearly one-fifth of the total gross private domestic investment. Further¬ 
more, this rate of accumulation meant that nearly 5 per cent of all 
goods (excluding services) produced in the economy during that quarter 
were for additions to existing stocks. 

In ordinary times such a rise would be symptomatic of trouble brew¬ 
ing elsewhere in the economy, such as in reduced buying power of indi¬ 
viduals and in distorted price, wage, and profit structures. Analysis of 
the basic economic developments in the past year, however, shows that 
demands for all t 3 ^s of goods were on the increase, that while distor¬ 
tions were appearing in the price structure, the problem businessmen 
were facing was to get a balance in inventories and to get enough goods 
of the right quality, style, and color to satisfy the ever-growing demands 
of consumers and producers, both domestic and foreign. The high rate 
of inventory accumulation experienced in the fourth quarter of 1946 
did not continue; nor did this spell recession. As better balance was 
obtained in the inventory holdings, supplies of goods flowing to ulti¬ 
mate consumers increased and the reduction in the rate of accumulation 
from 5.5 billion dollars in the fourth quarter of 1946 to less than one 
billion dollars in the fourth quarter of 1947, as shown in Table I, was 
offset by rising consumer expenditures, net foreign balance, and pro¬ 
ducers’ durables. The point commonly overlooked is that even though 
industrial production failed to rise materially during most of 1947, 
more goods were flowing to the ultimate consumer than in 1946 as less 
goods were being channeled into inventory additions. 

Let us examine in some detail the inventory position for the major 
groups of industries—^manufacturing, wholesale, and retail trades. 
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METHODS OF ANALYZING INVENTORIES 

The usual method businessmen employ in appraising the inventory 
position is to compare the current stock sales ratio with that in a period 
which would be considered as normal. This might be the average of a 
pre-war period of years of good business, or 1940 or 1941. This method 
has the advantage of simplicity, is easily understood and takes little 
time to develop. It is useful in many cases where data are not available 
over a long period and a more refined analysis cannot be made and where 
the current ratios are substantially different from the period chosen as 
normal. 


TABLE I 





Annual rates from 


1940 

1941 

4th Quarter 4th Quarter 




1946 

1947 



(BiUions of Dollars) 


Change in Inventonea: 





Current Valuation 

2.3 

3.9 

5.5 

0.6 

Book Value 

2.4 

7.2 

16.0 

8.0 

Gross Private Domestic Investment: 

13.0 

17.2 

30.4 

29.9 

Gross National Product: 

100 5 

125.3 

218.6 

240.9 

Sattos: 



(Perceniaffes) 


Change in Inventones to Gross Private Investment: 




Current Value 

17.7 

22.7 

18.1 

2.0 

Book Value 

18.5 

41.9 

52.6 

26.8 

Change in Inventories to Gross National Product: 




Current Value 

2.3 

3.1 

2.5 

0.2 

Book Value 

2.4 

5.7 

7.3 

3.3 


Source: V. S. Department of Commerce. 

The stock-sales ratio, however, has two distiact disadvantages and 
on that account can be misleading and result in a wrong and costly 
policy for buyers and merchandisers. First, the stability of the stock- 
sales ratio assumes that a given percentage change in sales is accompa¬ 
nied by a similar percentage change in stocks. For example, those who 
use the stock-sales ratio reason as follows: if it is assumed that a stock- 
sales ratio of 3 is considered normal in a year such as 1941, then a 
ratio of 2 in 1947, under conditions of substantially greater sales would 
imply thinness of inventories. This conclusion may or may not be the 
case. Further anal3rsis might show that in the particular bu^ess a 
change of ten per cent in sales is accompanied by a change of only 5 
per cent in inventories and therefore with greater sales, the stock-sales 
ratio should be lower in order that stocks be just adequate to support 
the larger volume of sales. In other words, in this example, the stock- 
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sales ratio of 2 would be in line with the historical experience of the 
firm and would indicate a balance rather than thinness in stocks. Sec¬ 
ond, use of the stock-sales ratio assumes that the stocks at any period 
of time must bear a fixed relation to the rate of sales in that period. 
Actually, inventory changes are the resultant of dynamic factors and 
are influenced by changes in activity of previous periods as well as long¬ 
term changes in turnover and merchandising policies of business firms. 

A more fruitful and logical technique for analyzing inventories is 
that of developing the relationship of inventories to the basic economic 
factors affecting their fluctuations. This is the approach used in this 
paper. 

THE DATA 

Overall inventory data are not available on a monthly basis before 
1939. Year-end data therefore must be relied upon for the earlier years. 
Even so, the existing data are not complete for all years. Census infor¬ 
mation is available for some years; data on department store stocks for 
all years back to 1919; the Bureau of Internal Revenue is the source 
for value of inventories held by corporations. In this report, the esti¬ 
mates made by Simon Kuznets and the National Bureau of Economic 
Research have been relied on primarily for the data before 1939. 
These estimates have been spliced to the Department of Commerce 
data beginning with December 31, 1938. The benchmark used by the 
Department was the Census of Manufactures and Census of Business 
for 1939 which contain data on the value of inventories at the beginning 
and the end of 1939. Since 1939 monthly data have been compiled by the 
Department on the basis of a representative sample of firms covering 
manufacturers, wholesalers and retailers. 

In general, the values of inventories analyzed are book value figures 
which reflect not only the current value of physical additions but also 
revaluations resulting from price changes. In view of the fact that price 
changes also affect sales or the value of output, the book value is the 
more appropriate measure to relate to the value of output or sales. 

It is diflicult to appraise the reliability of existing data on inventories. 
Book value figures reflect the various practices of firms which are not 
uniform from one industry to another or even within an industry. 
Various tests have indicated that, in general, existing sources tend to 
provide less reliable data on the UveU of inventories but are much more 
accurate with regard to the movements or changes. Since the analyas 
presented in this report is concerned with the development of historical 
relationships involving two or more variables, the accuracy of the level 
of inventories is not as important as the reliability of the size and direc¬ 
tion of the inventory change. 
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manufacturers' inventories in relation to production 

It has been found that the value of manufacturers' inventories is de¬ 
pendent not only on the value of output during a given period but also 
on the value of output in a previous period and on a gradual trend of the 
residuals of inventories after eliminating the influence of changes in 
the value of output. 

Using the data for the years 1920 to 1941, the regression of invento¬ 
ries on value of products is as follows: 

(1) =3.45 - .0122«+.0661v,+.0633i;p, 

where 

average value of manufacturers' inventories (arithmetic average 
of values at end of current and previous years), 
t =2(year-1931)+1, 

t;c=value of manufactured products in current year, 
t;p=value of manufactured products in previous year. 

The values are in billions of dollars. 

Table II and Chart I compare the actual inventories with the values 
calculated from the value of product according to regression (1), The 
fit is obviously very good. For the period 1920-41 the average percent¬ 
age errors (ratio of the residuals to the calculated values) is only 2.5 
per cent; with a maximum error of 8 per cent in 1920; the multiple cor¬ 
relation coeflBicient is 97 per cent.^ 

The fact that the inventory changes can be estimated very closely 
from changes in the value of output in the previous and current year 
provides a standard or a gauge for appraising the current inventory 
position relative to the value of production. 

The formula implies the following throe results based on the expe¬ 
rience of the pre-war period, 1920-1941: 

1. Everything else being equal, the value of manufacturers' 
inventories declined on the average by 24 million dollars per year. 
This has resulted from a steady improvement in the flow of mate¬ 
rials and from increased eflicioncy and better planning on the part 
of management, particularly the gradual introduction of inventory 
control S3rstems whereby the inventory units are geared to the 
rate of production. 

2. Other factors being equal, an increase (or decrease) of 1 bil¬ 
lion dollars in the value of output in the current year has been asso- 


1 If the value of output of the previous year is excluded fiom the analysis, the multiple coirelation 
coefficient is 91 per cent. 
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dated with a rise (or decline) of 660 million dollars in the average 
value of inventories. 

3. Assuming the same current year value of output and no 

TABLE II 

Value of Inventories of Manufacturing Industries 



Total 

Non-durables 

Calculated 

Inventories 

Per Cent of Residual 
to Calculated 


Value 

of 

Output* * * § 

Average 
Inven¬ 
tories t 

Value 

of 

Outputl 

Average 
Inven¬ 
tories t 

Totals 

Non- 

durablesIT 

Total 

Non¬ 

durables 



Bilhons of Dollars 



1920 

69.5 

11.2 

33.5 

5.8 

12.1 

6.1 

- 8.0 

- 4.9 

1921 

41.6 

10.3 

26.6 

5.2 


5.4 

- 4.8 

- 3.9 

1922 

50.0 

9.8 

28.0 

5.1 

9.6 

5.2 

2.0 

0 

1923 

58.2 

11.0 

32.9 

5.7 

10.6 

5.6 

3.7 

1.8 

1924 

63.5 

11.5 

30.5 

5.8 

10.8 

5.6 

6.7 

3.6 

1925 

60.8 

11.6 

35.1 

5.9 

11.0 

5.8 

6.7 

1.7 

1926 

62.5 

11.8 

36.0 

6.2 

11.5 

6.1 

2.5 

1.6 

1927 

60.3 

11.5 

35.8 

6.1 

11.5 

6.1 

0 

0 

1928 

61.2 

11.3 

36.5 

6.2 

11.4 

6.1 

0 

1.6 

1929 

68.0 

11.7 

38.4 

6.4 

11.8 

6.3 

- .9 

1.6 

1930 

51.0 

11.2 

32.5 

6.0 

11.1 

5.9 

.9 

1.7 

1931 

39.8 

9.5 

26.0 

5.1 

9.3 

5.1 

2.1 

0 

1932 

28.5 

7.6 

17.5 

4.2 

7.8 

4.2 

- 2.6 

0 

1933 

30.6 

7.2 

21.0 

4.0 

7.2 

4.1 

0 

- 2.4 

1934 

37.0 

7.8 

24.0 

4.5 

7.8 

4.4 

0 

2.3 

1935 

45.0 

8.2 

28.1 

4.7 

8.6 

4.8 

- 4.9 

- 2.1 

1936 

53.0 

9.2 

31.0 

5.1 

9.7 

5.2 

- 6.4 

- 1.9 

1937 

60.7 

10.6 

34.8 

5.7 

10.6 

5.3 

0 

7.5 

1938 

47.0 

10.5 

31.5 

5.6 

10.2 

5.5 

2.9 

1.8 

1939 

56.8 

10.2 

33.6 

5.5 

10.0 

5.5 

2.0 

0 

1940 

66.0 

11.2 

36.4 

5.8 

11.2 

5.8 

0 

0 

1941 

93.5 

13,5 

47.4 

6.6 

13.6 

6.7 

- 0.7 

- 3.0 

1942 

121.2 

17.0 

56.4 

7.8 

17.1 

7.9 


- 1.3 

1943 

148.5 

17.6 

62.3 

7.9 

20.6 

8.7 

-17.0 

- 9.2 

1944 

156.3 

17.8 

66.9 

7.8 

22.8 

9.3 

-31.8 

-16.1 

1945 

140.2 

16.4 

69.1 

8.1 

22.3 

9.7 

-36.0 

-16.5 

1946 

125.7 

18.3 

74.1 

9.3 

20.2 

10.1 

10.4 

- 6.9 

1947 

168.8 

22.1 

95.2 

10.8 

22.2 

11.9 

0.5 

- 9.2 


* Source: 1919-1939, census years from Dept, of Commerce. 1939-1947 from Dept, of Commerce* 
For inter-census years interpolated, see Appendix—^Table 1. 

t Source: 1929-1947 from Dept, of Commerce. 1919-1929 from National Bureau of Economic 
Research. 

t See Appendix—^Table 2. 

§ Calculated from Average Value of Manufacturing Inventories Bs3.45—.0122t+.0661 (value of 
current year in billions of dollars) +.0633 (Value of Product previous year in biiiinTiB of doUars) where 
f*2 (year—1931)+1. 

IT Calculated from Average Value of Non-durable Goods Inventories *1.67-.00986*+.0756 (cur¬ 
rent year value of product billions of dollars) +.0460 (preceding year value of product billions of dol¬ 
lars), where * =2 (year—1931) +1, 

Note: 1939-1947 the value of output is represented by the value of shipments which differs only 
by small amount from the former. 
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change in other factors, if the value of output in the previous year 
had been 1 billion dollars greater (or smaller), the average inven¬ 
tories in the current year would have been 630 million dollars 
greater (or smaller). 

The fact that the value of output of the previous year is about equal 
in importance to that of the current year in its influence on inventories 
implies that the current level of inventories is determined approximately 
by the rate of the value of output six months earlier. TMs amount of 
lag m inventory relative to sales is applicable to manufacturing indus¬ 
tries as a group and undoubtedly varies industry by industry and com¬ 
pany by company. 

What is the current position of manufacturers’ inventories relative 
to the pre-war relation of inventories to production? On the basis of 
the 1947 value of output of 169 billion dollars and that of 1946 of 126 
billion dollars, the average value of inventories in 1947 is calculated to 
be 22.2 billion dollars according to formula (1). The actual average for 
the year is estimated at 22.1 billion dollars, or about equal to the amount 
warranted by the value of output. Thus manufacturers’ inventories 
were in line in relation to the pre-war stocks-production relation during 

CHART I 
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1947.2 The rise in the value of these inventories of about 600 million 
dollars which occurred during the final quarter of 1947 did not bring 
stocks out of line with the value of products since the latter also in¬ 
creased from the previous quarter. From September to October, for 
example, the value of products increased by 8 per cent while the value 
of inventories rose by less than 1.6 per cent. Thus the value of manu¬ 
facturers’ inventories at the close of 1947 were in balance on the basis 
of the pre-war stocks-output relations. This condition, however, varies 
by industries within the manufacturing group. 

When manufacturers’ inventories are analyzed by the two major 
classifications of industries—durable and non-durable goods—^it is 
found that in the case of non-durable goods the value of inventories in 
1947 was deficient relative to the value of product to the extent of over 
1 billion dollars. This is derived from the pre-war relationship of the 
value of inventories of the non-durable goods industries to value of 
products of the current year and the preceding year. The formula de¬ 
rived from the data for the period 1920-1941 is as follows for non-dur¬ 
able goods manufacturing industries; 

(2) t;<n = 1.67-.00986i-|-.0756t;.-h.046i;p, 

where the symbols have the same meanings as in equation (1), except 
that Vin refers to non-durable goods. 

Where 2 (year—1931)-4-1—^average value of non-durable goods 
inventories equals arithmetic average of end of current year inventories 
with end of preceding year. 

As Table II and Chart II show, the formula approximates the actual 
value of inventories very closely, the average error for the period 1920- 
1941 being 2 per cent, while the index of determination is 98 per cent.® 

The fact that inventories are currently not up to the levels warranted 
by production indicates (1) that supplies of certain goods were hard to 
get and stocks could not be built up to required levels and (2) that man¬ 
ufacturers of non-durables deliberately kept their inventories at a mini¬ 
mum in a period when sales were at record levels. 

In contrast to the non-durable goods industries, inventories of the 
durable goods group are currently over one billion dollars above the 
amount indicated by the pre-war relationship of inventories to value of 
product of these industries. This is implied from the results obtained 


* It may be noted tbat in 1940 when inventories were exactly in Tfaft with the relationship to value 
of products as given by formula (1), the stock-sales ratio was .170 whereas in 1947 the ratio was .131. 
On this basis it would appear that inventories in 1947 were not up to the 1940 position relative to value 
of products. 

»If the value of output of the previous year is excluded from the analysis the index of correlation 
is 95 per cent. 
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by using formulas (1) and (2). At first sight, this would seem to indicate 
some trouble for these industries. Actually, the reason for the “exce&- 
sive” inventories in these industries is that there is considerable un¬ 
balance in the type of inventory holdings so that many companies hold 
large stocks of certain parts and components while they lack sufficient 
supplies of other goods needed to achieve an expanding volume of out¬ 
put. This means that the flow of materials has been uneven and part 
of the final output was held down because of inability to get a balanced 
supply of materials going into production. This situation has persisted 
ever since V-J Day and has been a major factor in limiting the flow 
of much needed durable goods to the ultimate consumers. 

WHOLESALE INVENTORIES 

Inventories held by wholesalers are much smaller than those held by 
retailers or manufacturers in relation to the volume of business done. 
The ratio of wholesale inventories to wholesale sales in October 1947 
was .45; for retail, it was 1.07; and for manufacturing, it was 1.49. 
Furthermore, the change in inventories is not so great in wholesale for 
a given change in business as it is in manufacturing or retail. In other 
words, wholesalers have a more rapid turnover and can get along with 
relatively smaller stocks. 
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As in the case of manufacturing, the value of wholesale inventories 
is related to the value of current yearis sales, previous year’s sales and 
a time factor. 

The specific formula for the period 1930-1941 is as follows: 

(3) 1.04-.032f+.0222s.+.02726sp, 

where 

average value of wholesale inventories (average of values at the 
beginning and end of the current year), 
i=2’(year-1936)+l. 

Sc = current year’s sales, 
previous year’s sales, 

and all money figures are in billions of dollars. 

This relationship shows that in fact the level of the previous year’s 
sales has a greater weight than that of the current year. As Table III 
and Chart III show, the average percentage error for the 12 year period 
is 1.8 per cent; the index of correlation is .999.^ The 1947 average value 
of wholesale inventories estimated at 6.5 billion dollars was 0.9 billion 
dollars lower than the amount calculated on the basis of the relationship 
to sales. Thus in contrast to manufacturing, it appears that the value 
of inventories held by wholesalers is below the economically warranted 
levels assuming the continuation of pre-war relationships. 

RETAIL INVENTORIES 

Of especial interest to businessmen is the appraisal of retail inven¬ 
tories since retailers are the first to feel the impact of changing con¬ 
sumer buying, price resistance, and shifting tastes. Many are concerned 
that retail inventories are rising at uncomfortably rapid rates and that 
such a development might spell danger in the near future. 

At this point it is well to distinguish between the appraisal of retail 
inventories in the aggregate as against the inventory holdings of par¬ 
ticular items by individual firms. In the former case, the impact of ex¬ 
cessive or deficient inventories is on the economy as a whole, while in 
the latter case it affects management’s policy decisions to purchase, 
order, or liquidate goods. In this analysis, we are concerned with the 
broader aspects of inventories on an industry-wide basis. This will pro¬ 
vide retailers with the facts on the over-all inventory position of their 
industry. It also provides a technique and method which each retailer 
can apply to analyze the inventory-sales problem for his particular 
conunodities. 


* IS the value of output of the previous year is excluded from the analysis the index of correlation 
is .98. 
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1930 1932 1934 1936 1938 1940 1942 1944 1946 1948 

The average value of retail inventories during a year is related to re¬ 
tail sales of the previous year. In addition, after allowing for the level 
of sales, the value of retail inventories has shown a declining trend in 
the period 1920-1941, Specifically, the regression equation relating 
inventories to sales for the period 1920-1941 is as follows: 

(4) t;.v=6.5-.C38f+.815sc+.5075y, 
where 

average value of retail inventories (averages of values at the end 
of the current and previous years), 
t—2 (year-1931)+1, 

Sc=retail sales in the current year, 
retail sales in the previous year. 

Values and sales are in billions of dollars. 

where <==2(year—1931)+!; average value of inventories is the arith¬ 
metic average of end of current year and end of previous year in¬ 
ventories. 
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TABLE III 


TRADE INVENTORIES 



Retail Trade 

Wholesale* 

Calculated 

Inventories 

Per Gent of Residual 
to Calculated 


Sales* 

Value of 
Inven¬ 
tories t 

Sales* 

Value of 
Inven- 
toriest 

Retailt 

Whole¬ 

sale! 

Retail 

Whole¬ 

sale 


Billions of Dollars 


1920 

43.0 

7.45 



7.38 


.9 


1921 

33.8 

6.78 



6.80 


- .3 


1922 

34.3 

6.04 



6.25 


- 1.8 


1923 

39.3 

6.48 



6.55 


- 1.1 


1924 

39.4 

6.83 



6.69 


2.1 


1925 

42.9 

6.97 



6.85 


1.8 


1926 

45.7 

7.19 



7.13 


.8 


1927 

44.8 

7.06 



7.07 


- .1 


1928 

46.0 

6.96 



6.99 


- .4 


1929 

48.5 

7.15 



7.13 


.3 


1930 

42.0 

6.61 

53.7 

4.48 

6.60 

4.41 

.2 

1.7 

1931 

34.8 

5.40 

40.6 

3.64 

5.55 

3.70 

- 2.7 

- 1.8 

MM 

25.0 

4.29 

30.8 

2.95 

4.26 

3.06 

.7 

- 3.5 

Bi] 

24.5 

3.81 

30.0 

2.74 

3.60 

2.71 

6.8 

1.2 

MM 

28.7 

3.91 

33.4 

2.84 

3.79 

2.70 

3.2 

5.2 

1935 

32.8 

4.09 

42.8 

2.87 

4.20 

2.94 

- 2.6 

- 2.5 

1936 

38.3 

4.50 

52.2 

3.26 

4.74 

3.34 

- 6.1 

- 2.5 

1937 

42.2 

4.97 

57.6 

3.72 

5.20 

3.65 

- 4.4 

1.8 

1938 

38.1 

5.06 

50.0 

3.56 

4.94 

3.56 

- 2.4 

0 

1939 

42.0 

5.00 

55.3 

3.43 

4.92 

3.41 

1.6 

0.5 

1940 

46.4 

5.34 

61.8 

3.64 

5.35 

3.64 

- .2 

0 

1941 

55.5 

6.29 

83.6 

4.21 

6.19 

4.23 

1.6 

0.5 

1942 

57.6 

6.94 

93.2 

4.34 

6.69 

4.98 

3.7 

-12.9 

1943 

63.7 

6.69 

99.3 

3.98 

7.17 

5.31 

6.7 

-25.0 

1944 

69.5 

6.44 

103.4 

3.98 

7.82 

5.50 

-17.6 

-27.6 

1945 

76.6 

6.36 

105.4 

4.14 

8.57 

5.60 

-25.8 

-26.1 

1946 

96.8 

7.83 

131.6 

5.11 

10.45 

6.17 

-25.1 

-17.2 

1947 

109.5 

10.09 

158.4 

6.50 

12.38 

7.42 

-18.5 

-12.4 


* Source: Department of Commerce 1929-1947.1919-1929 based on interpolation with consumers, 
expenditures data on goods* department store sales, etc. 

f Source: 1929-1947 from Dept, of Commerce. 1919-1929 from National Bureau of Economic 
Research. 

t Calculated from Average Value of Retail Inventories »6.5 —.6382 +816 (retail sales current year 
bills, of 8 ) +.507 (retail sales previous year bills, of 8 ), where t **2 (year—1931) +1. 

S Calculated from average value of Whol^le Inventories »1.04—.0322+.0222 (current year 
sales bills, of 8 ) +.02726 (previous year’s sales bills, of 8 ), where 2 *2 (year—1936) +1. 

This relationship shows that the level of current year retail sales 
has over one and a half times as much weight as the retail sales of the 
previous year in its effect on inventories. As Table III and Chart IV 
show, the correlation is very tight for the period with an average an¬ 
nual error of 2 per cent for the 22-year period, and a maximum residual 
of only 6 per cent in 1933. The index of correlation is .99.® 


B If the value of output Is excluded from the analsrsis the index of correlation is 95 per cent. 
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CHART IV 


13 I 


RETAIL INVENTORIES 

ACTUAL AND CALCULATED 


ACTUAL 







CALCULATED 




In sharp contrast to the inventory position in manufacturing, retail 
inventories currently are significantly below the amount calculated 
from sales on the basis of formula (4). The average value of retail in¬ 
ventories in 1947 was 10 billion dollars whereas on the basis of 1947 
retail sales of 109.6 billion dollars and of 1946 sales of 96.8 billion dol¬ 
lars, formula (4) yields a calculated value of inventories of 12.4 billion 
dollars. This means that retailers today are operating with a sig¬ 
nificantly smaller inventory than is indicated by the pre-war relation 
of sales to inventories. There are three reasons for this: first, many 
goods arc still short in supply and retailers cannot accumulate suf¬ 
ficient quantities to bring stocks up to the pre-war relation of stocks 
to sales. Examples of relative shortages in the consumer durable goods 
field are of course well-known. To some extent this situation also pre¬ 
vails in certain soft goods. One of the large department stores in New 
York, for example, foimd recently that of the customers who failed to 
buy anything at all, a significant proportion wanted goods which the 
store did not have on hand. In other words, if certain goods had been 
more plentiful, sales of many retail stores would have been greater. 
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Second, many retailers have been pursuing a deliberate policy of hold¬ 
ing inventories down to a minimum. They are uncertain of the future 
because they have never experienced such high levels of sales and they 
do not want to be caught with high-priced stocks when competition 
begins in earnest. Third, the economy since V-J Day has been operating 
under conditions of relatively full employment with a concomitant high 
level of retail sales. A characteristic feature of this period was the short¬ 
ages of many goods relative to demand. Competition has only been a 
factor in recent months. Since retailers could sell practically an^dihing 
they had, the turnover rate has been greater than in the pre-war 
period and they did not find it necessary to accumulate inventories to 
conform with pre-war stock-sales relation in order to achieve a high 
rate of sales. 

DEPARTMENT STORE STOCBB 

The data on inventories by kinds of retail business are not sufficiently 
reliable over a long enough period to permit a correlation analysis with 
sales. However, analysis of the sales-inventory relation for depart¬ 
ment stores is probably typical of other major types of non-durable 
goods stores. 

CHART V 
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APPENDIX—TABLE 1 


Method for Intel polating Value of Products in Manufacturing 



(1) 

Value of 
Products 

(2) 

FRB 

Manufacturing 

Production 

(3) 

BLS 

Prices Other 
Than Farmer 

(4) 

Index of 
Manufacturers' 
Products 

Year 

(Bills. Dols.) 

(1935-39=100) 

1919 

60.2 


161.6 

116.4 

1920 

69.5 


190.1 

140.7 

1921 

41.6 


122.9 

68.8 

1922 

50.0 

74 

119.5 

88.4 


58.2 

86 

123.9 

106.6 


53.6 

81 

119.2 

96.6 

1925 

60.8 

90 

124.5 

112.0 

1926 

62.5 

95 

122.8 

116.7 


60.3 

04 

116.2 

109.2 

1928 

61.2 

99 

116.4 

115.2 

1929 

68.0 

110 

114.6 

126.1 

1930 

51.0 

90 

105.5 

95.0 

1931 

39.8 

74 

91.6 

67.8 

1932 

28.5 

67 

83.9 

47.8 

1933 

30.6 

68 

84.7 

57.6 

1934 

37.0 

74 

94.4 

60.9 

1935 

45.0 

87 

98.5 

85.7 


63.0 

104 

99.1 

103.1 


60.7 

113 

105.8 

119.6 

1938 

47.0 

87 

99.0 

86.1 

1939 

56.8 

109 

97.6 

106.4 


Note: Value of Products in odd years from Census; even years interpolated by regresmon of value 
of manufacturer’s products of Column 4 and census value. 


The value of inventories held by department stores is related to the 
current year sales of department stores and sales of the previous year 
according to the following regression developed for the period 1920- 
1941 ^ 

(5) t;M=1.4-.993t+.8864Sc+.25398« 
where 

»«=average value of inventories of department stores relative to 
1935-39=100, 
t=2 (year-1931)+1, 

8e=department stores’ sales in the current year (1935-^9=100), 
«p=department stores’ sales in the previous year (1935-39=100). 

Here again, as Chart V shows, the degree of correspondence is very 
close with an aimual average error for the entire period of 2 p^ cent 

* The data on inventories and sales are based on published indexes of the Board of Govenunrs of 
the Federal Reserve System in various bulletins of that agency* 
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and an index of correlation of 97.3 per cent. On the basis of an index of 
department store sales average of 281 for 1947 and 264 for 1946 (1936- 
39 = 100), the index of the value of department store inventories is 
calculated to be 285. This compares with the estimated actual index of 
inventories of 253. Thus, currently, department store stocks are low 
when appraised in terms of the pre-war stocks-sales relationship. Chief 
factors in this situation are the deliberate policy of department store 
executives to hold inventories down to a minimum and inability to get 
adequate supplies of many goods. 

APPENDIX—TABLE 2 


Method for Interpolating the Value of Products in Manufacturing 
Non-durable Goods Industries 


Year 

(1) 

Production of 
Non-durable 
Goods 

(2) 

Wholesale 
Prices in 
Non-durables 
1939 »100 

(3) 

Value of 
Non-durable 
Manufacturers 

(4) 

Value of 
Products 

1919 

62 

186.6 

153 

36.3 

1920 

60 

218.0 

131 

33.5 

1921 

57 

139.4 

79 

26.6 

1922 

67 

132.4 

89 

28.0 

1923 

72 

135.7 

98 

32.9 

1924 

69 

132.3 

91 

30.5 

1925 

76 

140.7 

107 

35.1 

1926 

79 

137.9 

109 

36.0 

1927 

83 

128.0 

106 

35.8 

1928 

85 

129.3 

110 

36.5 

1929 

93 

126.0 

117 

38.4 

1930 

84 

114.7 

97 

32.5 

1931 

79 

97.4 

77 

26.0 

1932 

70 

86.1 

60 

18.5 

1933 

79 

87.4 

69 

21.0 

1934 

81 

98.7 

80 

24.0 

1935 

90 

105.1 

95 

28.1 

1936 

100 

106.1 

106 

31.0 

1937 

106 

112.7 

119 

34.8 

1938 

05 

101.9 

97 

31.5 

1939 

109 

100.0 

109 

33.6 


Column 1: Federal Reserve Board. 

Column 2: Obtained by combining BLS components of non>durable goods prices by BLS weights. 
Column 3: Product of Column 1 and 2. 

Column 4: Odd years—Census of Manufactures; even years—^interpolated by a regression be¬ 
tween Column 3 and census values for the odd years. 



















MEASURING PHYSICAL INVENTORIES* 


Weir M. Brown 

Board of Governors, Federal Reserve System 
SUMMARY 

This paper seeks to emmine some of the problems encoun¬ 
tered in measuring and interpreting the physical volume of 
inventories and to suggest some methods of approaching these 
problems. Public attention to the subject of inventory changes 
certainly has not been lacking in the period since VJ-Day. In¬ 
deed, the growth in inventories has to some persons appeared 
so sdgnihcant that the period is sometimes characterized 
simply—^probably too simply—^as an “inventory boom.” 
This paper will not attempt to describe the course of recent 
inventory fluctuation, aside from a few isolated illustrative re¬ 
marks in passing. Rather, this opportunity will be taken to 
scrutinize the fundamental paths of approach to a fleld in 
which existing knowledge must still be regarded, by any sober 
standard, as limited. 

T he term “inventories,” as ordinarily used, is applied to the stocks 
of finished goods held by business firms for sale and to stocks of 
raw or semi-fihished materials held either for sale or for processing into 
marketable goods of another form. Thus in customary usage the term 
refers principally to stocks in the hands of manufacturers and distribu¬ 
tors engaged in making and selling tangible goods in the market. It is 
not always perceived that inventories in actuality are held also by 
service and extractive industries, government organizations, and indi¬ 
vidual consumers, and that conceptually the idea of inventory is uni¬ 
versal. In this paper, the term “inventory” is employed in a broad sense 
to refer to any stock of xmconsumed goods, in whatsoever position in 
the economy it may be held. 

If we imagine for a moment a hypothetical economy which is per¬ 
fectly static—^in which industrial techniques and consumer tastes are 
fixed, the volume of employment and production coAstant, and with 
even seasonal variations spirited out of existence—^it is obvious that in 
such a state the amounts and kinds of goods held in inventory likewise 
would have become perfectly constant, having adjusted themselves to 
the technical requirements of an imchanging level of consumption of 
raw materials on the one hand and a fixed production and consumption 
of finished products on the other. 


* Paper delivered at the Annual Meeting of the American Statistical Association, New York City, 
December 28,1947. 
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But in the real world, the quantities of goods held in stock in various 
sectors of the economy are not constant but fluctuate. That is the pri¬ 
mary reason why inventories hold any interest for us or present any 
analytical challenge. An increasing demand for the products of one 
industry which results in expanded output will ordinarily operate to 
increase the work-in-process stocks of the industry and may also affect, 
in one direction or the other, the size of the materials and finished prod¬ 
ucts inventories as well, although the short-run effects (on, say, finished 
goods stocks) may be quite different from the long-period effects. Of 
equal importance with changes in demand are vajying supply condi¬ 
tions, whether of a periodic or occasional nature. When the market 
changes represent broad forces affecting the general state of demand or 
conditions of supply, the shifts imposed on the compodtion and size 
of inventory holdings may be very marked and complicated. In turn, 
the line of influence may flow in the other direction, for inventory 
levels and movements therein may themselves bear importantly on the 
econonede mtuation. 

When we concentrate on the change aspect, it is perhaps easier to 
elucidate the meaning of inventories and to understand their relevance 
to the functioning of the system generally. The stock of any good in 
store at the beginning of a given period constitutes that quantity which 
was produced in preceding periods but not consumed. It is a capital 
good (although not necessarily a producer’s good, of course). It is 
analogous in many respects with other kinds of physical assets, both as 
it relates to the individual holder and in relation to the economy as a 
whole. A transfer of inventory between two individual firms is an act 
of investment for the buyer and one of disinvestment for the seller. For 
the system as a whole, an increase in the total amount of goods held in 
inventory by all firms constitutes a positive investment, and a cor¬ 
responding diminution of total stocks of goods represents a consump¬ 
tion of capital or disinvestment. This is a point to which we shall need 
to return later. 

MEAStTBEiMBNT CONSIDBBED WITH BEFBBENCH TO FUBFOSH 

The breadth or narrowness with which inventories are defined de¬ 
pends upon the purpose in view. Moreover, the purpose of the investi¬ 
gation also determines whether stocks will be measured in terms of 
phytical quantities or dollar values, and it influences decitions regard¬ 
ing the exact manner in which any necessary combining of individual 
series will be accomplished. It was pointed out above that in customary 
usage the emphasis in discustions of inventories is ordinarily upon 
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so-called “business inventories,” that is, the holdings of manufacturing 
plants and wholesale and retail distributors. There are several respect¬ 
able reasons in partial justification for this emphasis. For one thing, 
current statistical data are more readily available for these groups than 
for some others, although this circumstance is adventitious rather than 
inherent. A second reason is that the stocks of some other groups which 
are important quantitatively—such as those of family consumers or of 
public utility companies—are not being held for sale to others and hence 
do not “hang over the market,” in the same sense as the stocks of manu¬ 
facturers or retailers. For some purposes, it may be permissible to ex¬ 
clude such holdings by groups other than manufacturers and dis¬ 
tributors from one’s definition of total stocks; or, if they are not so 
explicitly excluded, to accept with complacency the present unavail¬ 
ability of the necessary current statistics. Yet for other purposes—and 
these include the task of analyzing at any time the current business 
situation—^the results wiU be only partial and probably will be mislead¬ 
ing if stocks in all positions are not or cannot be adequately considered. 
Although it is true that some stocks are not being held for sale and 
hence do not manifest themselves prominently in direct transactions, 
such stocks nevertheless do “hang over the market” in a very real 
sense. The size of these holdings and changes in their magnitudes are 
important determinants governing the behavior of the holders, influ¬ 
encing not only the strength of their demand for the commodity in 
question but also conditioning their productive efforts. Manufacturing 
and trade inventories are undoubtedly large, and attention had been 
concentrated on them even before Adam Smith used the term “the 
profits of stock” in describing the activities of the 18th century mer¬ 
chant capitalists. But stocks of unconsumed goods are widely held in 
many sectors of the economy. An addition to or a withdrawal from 
stoc]^ represents the amount by which the consumption of a given 
good differs from the amount produced during the period. There can 
be no satisfactory measurement of consumption unless adequate in¬ 
formation can be developed concerning movements in stocks held in 
all positions within whatever sector of the economy is under examina¬ 
tion. 

For some purposes a useful, though seldom used, compilation of 
inventory data can be made on a commodity ba^, tracing the stock 
holdings of a given commodity or group of commodities through all 
stages of the productive, distributive, and consumption proce^. This 
type of presentation has applications both in the study of pioticular 
industries or commodity groups and in describing the inventory posi- 
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CHART 1, 

COMPARATIVE MOVEMENTS IN PHYSICAL VOLUME OP MANUFACTURERS’ AND 
WHOLESALERS* STOCKS OP SELECTED MANUFACTURED FOODS 
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tion of larger sections of the economy. There are instances in which, 
for the purpose in hand, it does not matter appreciably at what stage 
in the economic process stocks of specified finished goods are held, and 
all such finished stocks can be considered as a total, or at least consid¬ 
ered together. This might be the case, for example, if the problem were 
to ascertain, for certain consumer goods in short supply, what quantity 
could be available for consumption under conditions of strictly-exer¬ 
cised government controls. Under other circumstances, however, the 
manner in which a given total amount of inventory were divided 
between manufacturers, retailers, and other holders might make a sub¬ 
stantial difference on the price developments to be anticipated or on the 
measures required to make a given amount of the commodity available 
for export. Also, it is scarcely necessary to say that at times when major 
movements are taking place in prices and the volume of business, the 
manner in which inventories are divided among holders will help to 
determine which groups in the economic structure will reap profits or 
sustain losses. 

Chart 1 gives specific illustration to this fact that inventory holdings 
of the same commodity in different positions may differ considerably, 
both as to magnitude at any one date and as to movements over time. 
Stocks in the hands of two groups—^manufacturers and wholesale dis¬ 
tributors (including chain store warehouses)—^are presented for three 
different manufactured food items: evaporated milk, canned peaches, 
and total caimed fruits and vegetables. The series charted are index 
numbers representing the physical volume of end-of-month or end-of- 
quarter stocks during the period 1942-1947, with the 1943 average for 
each series taken as 100. The several series exhibit different relation¬ 
ships not only in the comparative month-to-month movements of the 
two groups of holdings but also in their current levels relative to the 
base year. Since no current data are available on the stocks of these 
commodities held by retailers, family households, or exporters, the 
chart docs not give a complete account of total stocks in all hands. It 
does suggest the necessity of caution in interpreting changes in the 
quantities held in any one position or in drawing inferences from changes 
in the ratio of stocks to production (or sales) at any one stage. 

Ainother method of classifying or grouping inventories the meaning 
of which needs to be closely examined with reference to any given pur¬ 
pose is the segregation of stocks according to “stage of fabrication.” 
This is the familiar three-fold classification of raw materials, goods in 
process, and finished goods. This method of division is a valid one when 
confined to the various holdings of a particular firm, since such a segre¬ 
gation can be made unambiguous (or virtually so) and since it serves 
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the operational needs of the management. The same method may some¬ 
times be applied to an entire industry, although greater care must be 
exercised. But if we should desire to classify the total inventories of all 
manufacturing industry according to stage of fabrication, it is doubtful 
if our purpose would be accomplished by adding up in one column all 
the goods classed as “finished” by their producers, in another column 
all the goods individually reported as “raw materials,” etc. Such a proc¬ 
ess would have the anomalous result of including in the tally of the 
nation’s stocks of finished goods a substantial quantity of wood pulp, 
sulfuric acid, pig iron, cotton yam, and other very primary products 
which, from the standpoint of the economy as a whole, are scarcely 
finished. It would also place another large portion of the wood pulp, 
pig iron, and similar stocks in the raw materials category, since they 
are so regarded by the paper and rayon manufacturers and steel mills 
which hold them for further processing. An alternative approach to the 
question of measuring the degree of fabrication of manufacturing stocks 
which may prove more fruitful, though admittedly more diflSicult, is 
that of classifying inventories (measured in either physical or value 
terms) according to whether they are entirely finished products, inter¬ 
mediate goods, or primary materials with reference to the manufactur¬ 
ing process as a whole. 

INVENTORY MEASUREMENT PROBLEMS 

We turn now to a more detailed discussion of the measurement prob¬ 
lems encountered in dealing with inventories, with special reference to 
the task of reckoning physical volume. It will be apparent from what 
was said earlier in analyzing the nature of inventories that they are 
additive in character, and to a gmater degree than some other ecofsomic 
variables, such as the value of product. This characteristic derived >rom 
the fact that inventories are pertinent to a point in time rather thah to 
a period of time. A stock of goods is a fund concept rather than a flow 
concept; it is a quantity rather than a rate. Since the same stock cannot 
be found in more than one point on the same date, there are no problems 
of overlapping or duplication (apart from errors in collecting data) 
encountered in adding up inventories. This is not to suggest, of course, 
that economic variables which embody flow concepts (i.e., so much 
quantity per unit of time) cannot be added or that rates cannot be 
averaged. The problems are greater, however, due to the familiar fact 
that during any given period a good may pass through more than one 
manufacturing or distribution stage, and at the end of each stage it 
will appear as a “product.” 

Despite this fact that inventories do possess the quality of being addi- 
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tive conceptually, there are nevertheless important problems to be 
faced in any actual process of summation. These difiSculties vary from 
one situation to another, depending upon the kind of commodity groups 
involved, upon the extent of the economic area or areas over which com¬ 
bined measurement is sought, and upon the kind of terms (value or 
quantity) in which the primary data are expressed. Some of the more 
significant issues can be divided into two broad groups, those arising 
in the process of dealing mth value figures and those arising in the use 
of quantity figures. Both the physical-volume approach and the dollar- 
value approach to inventory measurement are large subjects, and in a 
paper on physical inventory measurement our discussion of value data 
must be confined largely to the usability of such data in deriving indi¬ 
cators of quantity. First, however, I should like to allude briefly to 
certain features characteristic of inventory value figures in general 
which compromise their meaningfulness when aggregated, whether in 
their original form or in some deflated form. 

A fundamental consideration is that figures on inventory values are 
creatures of corporate cost accounting. As such, they can probably be 
regarded as having been designed to fit the operational needs of the 
management—or at least as conforming to whatever practices have 
grown up and become habitual in the company. But there are several 
limitations attaching to the collecting and adding together for purposes 
of economic analysis of inventory figures which represent accounting 
book records developed for such different purposes by firms utilizing 
heterogeneous accounting methods. I have reference here principally to 
the limitations arising out of the great lack of uniformity among com¬ 
panies in the accounting treatment accorded various types of inventory. 
Thes characteristics of the original data complicate and equivocate 
any i ggregate measures of the total value of inventories. 

Bat a much more diflBicult task of measurement and interpretation 
is encountered if one attempts to employ any given series comprising 
the total dollar value of inventories as source material from which to 
derive an indication of movements in the physical quantity of stocks. 
It is apparent that inventory value figures, like any other value aggre¬ 
gates, are compounds of two types of element—a quantity element and 
a valuation or pecuniary element—^and that changes in the value of 
inventories may be brought about by changes in either component. 
Because of the fact that the value of inventories does embody a pe¬ 
cuniary element and is known to respond, to some degree and with 
more or less flexibility, to changes in the prices at which goods are ex¬ 
changed in the market, it is often thought that inventories reported in 
value terms should be deflated by some generally-known price index in 
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an endeavor to obtain a measure of quantity changes apart from the 
influence of price movements. But the pecuniary elements which are 
embedded in accounting records of inventory value are not comparable 
with available price index series. In fact, these pecuniary elements do 
not constitute price data at all. Consequently, an attempted physical- 
volume determination through a deflation of an inventory value index 
confronts the obstacles (a) that corporate inventory records usually 
do not carry stocks of finished goods at current market selling prices 
but at one or another of several different cost figures having various 
time references; (b) that a significant portion of total inventories in 
some sectors of the economy represents intermediate products or goods 
in process which seldom if ever enter the market; (c) that many firms 
follow a practice of making certain adjustments in one or more of their 
inventory figures at the end of an accounting period to reflect changes in 
replacement costs; (d) that it is diflGlcult to achieve comparable com¬ 
modity coverage, even for finished goods, between the price index and 
the inventory value index; and (e) that, even when the commodities 
covered conform closely, the weighting systems employed in available 
price indexes take account of relative importance of various goods sold 
rather than relative importance of different goods in inventory. Because 
of these difl&culties one could hardly expect a process of deflating in¬ 
ventory value data to 3 deld a satisfactory measure of movements in the 
physical magnitude of stocks. 

In a recent investigation of physical stocks in the hands of food 
manufacturers,^ the deflation procedure was also tried to see how a se¬ 
ries showing the value of inventory in terms of constant prices would 
compare, in this field, with an index of the physical quantity of stocks 
measured directly. The differences in behavior of the two types of index 
were marked. The deflated value series showed much less regularity of 
seasonal movement mthin itself than did the physical index; the short¬ 
term movements of the two series did not correspond closely; and there 
were significant differences in the postwar levels of the two indexes 
relative to the base period. Although the dissimilarity was partly at¬ 
tributable to differences in coverage and differences between the weight¬ 
ing system employed in the price deflator and that of the inventory 
series, a substantial part of the dissimilarity was believed explainable 
by the difilculties inherent in attempting to relate the cost-accounting 
valuations represented in inventory book-value data to changes in 
market prices. 

1 Desenbed in an unpublished manuscript, which has been circulated within the Federal Reserve 
System in mimeographed form, "Changes in Physical Stocks Held by Food Manufacturers” (December 
1947). 
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DIRECT MEASUREMENT OP PHYSICAL INVENTORIES 

In view of the dubious meaning of figures arrived at through a defla¬ 
tion of inventory value series, it seems desirable to approach the task 
of ascertaining the physical volume of inventories directly through the 
use of data expressed in quantity terms. This is no novel idea, of course. 
Physical inventory statistics undoubtedly would be more widely em¬ 
ployed if they were more readily available. Some of the problems en¬ 
countered in developing such measures should be described. 

First, as to individual series. Current series on inventories in quan¬ 
tity terms are regularly collected for numerous commodities. There is 
cause for gratitude in the fact that the number of individual commodi¬ 
ties covered by current series has slowly increased. Most of the series 
emanate from government agencies or trade associations, and ordinar¬ 
ily they relate to items which are the major products of or principal 
materials consumed in a given industry or trade group. The available 
stocks series ordinarily cover a very high proportion or the entire 
quantity of such stocks held in particular positions. 

After these encouraging facts have been enumerated, it still must be 
ackjnowledged that the series presen,tly available on commodity stocks 
are inadequate in several respects. (1) Although there are many stocks 
series on manufacturing industries, a few for agriculture and whole¬ 
saling, and a scattering for other groups such as importers and Federal 
government agencies, the coverage of industries is by no means uni¬ 
form. Even within the manufacturing sector of the economy, where 
coverage is greatest, there are wide differences in the amount of physi¬ 
cal stocks data available. If we consider the manufacturing field ac¬ 
cording to Census industry groups, we find reasonably satisfactory 
current data for foods, textile-mill products, paper and pulp, rubber, 
petroleum and coal products, and the stone, clay, and glass group. On 
the other hand, there arc no continuous stock scries whatsoever for 
the motor-vehicle, transportation equipment, or furniture groups and 
virtually none for machinery. The remaining industries fall somewhere 
between these extremes. (2) A second point of inadequacy is that cover¬ 
age is not uniform as to type of commodity. More series are available 
on raw materials or on finished products of a primary sort than on 
highly advanced or diversified finished products. There are very few 
series on stock holdings of wrapping and packaging materials or of fuel 
held by consuming plants. Fortunately, the proportion of total inven¬ 
tories left uncovered by these circumstances is smaller than the abso¬ 
lute number. (3) Outside of the manufacturing sector, series on physical 
stocks are even less numerous. In general, data on physical stocks are 
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fewer than inventory value data in almost every major sector of the 
economy. The comparison is especially unfavorable in retail trade. 
Exceptions to this generalization are found in agriculture and in a few 
other cases. It is noteworthy that there have even been some attempts 
to gather information on consumer stocks, though there are few con¬ 
tinuous series in this important field. 

Apart from the question of obtaining individual series for particular 
commodities, there are the further problems of how individual series 
may be combined and how inventory movements in broad areas of the 
economy may be represented. Some tentative recommendations on 
these matters can be advanced, based on the investigation we have 
made of stocks in the hands of food manufacturers, although methods 
employed in that field might require more or less alteration if applied 
elsewhere. First of all, where the series to be combined refer to many 
heterogeneous commodities, group measures cannot be obtained by 
direct summation, and any index number will have to be an average 
of relatives. In most cases, some system of explicit weights will appear 
desirable, in order to accord greater influence in the total index to move¬ 
ments in stocks of major importance than to movements in those of 
minor importance. In constructing the index of stocks held by food 
manufacturers, it was decided that the best criterion of “importance’^ 
for this purpose was provided by the value of inventory of the respec¬ 
tive commodities in 1939 as recorded by the Census of Manufactures. 
The use of weights derived from inventory value data means that inso¬ 
far as the “q” elements embodied in the value figures correspond with 
the “q’s” in the series themselves, we are in effect weighting with 1939 
prices. In making use of Census of Manufactures data for deriving 
weights, two other technical problems arise which can only be men¬ 
tioned in passing. One is the task of deriving from Census inventory 
value figures collected on an industry basis estimates of inventory val¬ 
ues for particular commodities within the industry, and the other is 
the adjustment required by the fact that the Census figures represent 
year-end values rather than averages for the year. 

An important fact to be observed is that in measuring stocks by this 
method many items are left “unrepresented,” i.e., any representation 
is only by the index as a whole and not by parts. The decision to use 
this method is dictated by the difficulty of inferring movements in the 
inventory of one commodity from movements in stocks of some other 
commodity for which data are available. In measuring production, 
there are certain well-known relationships which can be relied upon 
with more or less security in statistical measurement. For example. 
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certain goods are made as “joint products” of the same manufacturing 
process. If a series of figures is available on the slaughter of cattle, for 
instance, it can be made to serve as a measure not only of packing-house 
production of beef but also of cattle hide production. Sometimes the 
consumption of materials is used to represent the activity involved in 
processing these materials into finished products. There is also a rela¬ 
tionship, though of a different sort, between the output of a raw mater¬ 
ial and that of a finished product in which it is a major ingredient. But 
relationships between the stocks of two different goods are much less 
predictable than output relationships. Even in the instance of jointly- 
produced goods like beef and hides, there can be no assurance that if the 
packer’s stocks of hides are growing his stocks of beef are likewise in¬ 
creasing. Demand factors may be such as to move stocks of the joint 
product in the opposite direction or to leave them unchanged. There 
are analogous difficulties in dealing with raw material stocks. 

If one of the available series included in an index of stocks were to 
be accepted as measuring not only the fluctuations of that one com¬ 
modity but also variations in some other material or product, the im¬ 
portance of the unrepresented items could be assigned, by means of 
“imputed weights,” to the included series. But the decision to impute 
weights in this fashion would involve the assumption that fluctuations 
m an available series would faithfully reflect movements in the unrep¬ 
resented stocks whose weights it carried. In food manufacturing, the 
available statistical material has not 3 delded empirically any reliable 
relationships on the strength of which a system of indirect representa¬ 
tion could be justified; moreover, there ore reasons a priori, as mentioned 
in the paragraph above, for believing that in some cases, at least, no 
reliable relationship exists. 

For those reasons, it appeared best practice in computing the index 
of food manufacturers’ stocks to let any given series represent only the 
stocks of the commodities covered directly; the weights of other items 
were not imputed to the series of represented items. When this method 
is followed, the selection of items for inclusion in an index is based on 
their measurability and the resulting index is representative of move¬ 
ments not of all the relevant stocks but only of those for which quantity 
data are included in the index. In the case of the food index, further 
investigation is required to establish the validity of using this index as 
a measure of total food stocks in manufacturing. However, the infer¬ 
ence should not be drawn that these uncovered items would actually 
have been “represented”—^in the sense that the index would be more 
faithfully reflective of movements of all food inventories—^if wmghts 
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attributed to them had been assigned to some one or more series of 
represented stocks. The imputation of weights, the process of nominat¬ 
ing one statistical series to double or act as proxy for another, has many 
legitimate uses, but it is invalid unless the relationship between the 
two is so well established that a substitute can be relied upon to behave 
in a manner similar to, or predictably different from, that of the absent 
member. Although this has not been found to be the case in the work 
here described, there is no implication that the same lack of consistent 
relationship would necessarily be found in stock data in other com¬ 
modity fields. 

CONCLXTSIONS 

Any conclusions to which this examination may lead should be re¬ 
garded as tentative. However, I should like, in review, to emphadze 
certain points. Just as there can be no single, all-purpose index of prices, 
there is no single measure of inventories—^nor even a single method of 
measurement—^which can serve all purposes. We need to be critical in 
scrutinizdng any index or other compilation of inventory data to make 
sure of its meaning or relevance to tha problem in hand. Both the dollar- 
value approach and the physical-volume approach have an important 
place in economic analysis; but they do not always have the same 
place. Speaking generally, inventory value figures are more appropriate 
for use in conjunction with corporate financial-statement data and with 
some other types of data expressed in dollar values. Pineal stocks 
figures are more appropriate for use in connection with production 
measures and certain other data reckoned in quantity terms. Because 
of fimdamental differences in their nature, data on book values often 
cannot be made to yield fully satisfactory indications of changes in the 
phyacal quantities of stocks. Physical stocks held in some parts of the 
economy can be directly measured with a fair degree of reliability from 
present data, but before indexes representative of very broad areas of 
the economy can be constructed there must be a substantial increase 
in the number and coverage of basic series. Since the concept of inven¬ 
tory is considerably broader, in my opinion, than current usage would 
indicate, there is a corresponding need to extend the investigation of 
stocks to commodities and to economic groups—^notably family house¬ 
holds—thitherto largely ignored. 

The further study of inventories should also be extended in two other 
directions. Detailed study is needed of the relationships between the 
size of inventories in a particular industry and its rate of activity. In¬ 
vestigation to date seems to point to several preliminary conclusions. 
(1) The ratio of stocks to sales (or production) is not constant at all 
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levels of activity. (2) DiiBferent classes of inventory within the same 
industry exhibit different movements in relation to activity. Often 
there is a fairly close association between the amount of goods in proc¬ 
ess and the rate of operations; but there are many possible relation¬ 
ships between changes in activity and movements in stocks of mate¬ 
rials or finished goods. (3) The relationship between inventory holdings 
and sales is influenced by the general state of activity as well as by the 
level of activity in the industry in question. Stocks are less likely to 
rise pari passu with increases in activity when the economy is operating 
close to full employment than at lower levels. (4) The degree of corre¬ 
spondence depends not only upon the level from which a given move¬ 
ment in activity or change in stocks commences but also upon its rapid¬ 
ity. (Some of these relationships can be seen by a careful reading of 
Chart 2, which shows comparative monthly movements of production 
and stocks of an important group of foods during the period since 1935. 
The correspondence between the growth of output and of inventories 
of these goods which had characterized most of the period through 1940 
disappeared after that time, and the high levels of food production 
achieved during the war and subsequent to it were accomplished with¬ 
out a commensurate rise in stocks held at the manufacturing level.) 

Another matter requiring further study is the effect on the economy 
as a whole of various amounts of inventory change. In the imaginary 
static society earlier referred to, the levels of inventories were constant; 
there was never a so-called “net change in inventories”; or, rather the 
net change was always zero. In the actual world, although large net 
changes in inventories are ordinarily associated with periods of signifi¬ 
cant shifts in the general economic situation, it would be dangerous to 
treat a zero net change in inventories as necessarily signifying the exist¬ 
ence of stable conditions. The relationships between the general level 
of activity and the amount of inventory change need to be investigated 
in terms of gross changes as well as net. 



ON THE DETERMINATION OF SAMPLE SIZES 
IN DESIGNING EXPERIMENTS* 


MABII4YN HABBiSyf D. G. Horvitz, and a. M. Mood 
Iowa State College 

Methods are developed for determimng sample sizes re¬ 
quired to estimate means of normal populations with given 
precision, or to give significance when means differ by a speci¬ 
fied amount. The methods assume that an estimate of the vari¬ 
ance has been obtained from a previous or preliminary experi¬ 
ment. A simple device is suggested for using a priori informa¬ 
tion about the variance when a formal estimate is not avail¬ 
able. 


1 . INTRODUCTION 

S tatisticians frequently encounter the problem of determining the 
sample size required to obtain a given result, for example, to esti¬ 
mate a mean by a confidence interval no longer than a specified amoimt. 
or to show a significant difference between two means if the difference 
is greater than a specified amount. The problem cannot be solved if 
nothing is known of the population distribution, but this is a rather 
unusual situation. Experimental workers generally know, for example, 
whether their populations are sensibly normal or not, and if not, what 
transformations will make them so. But this knowledge (of the form of 
the population distribution) is still not enough to make the problem 
determinate. It is necessary also to know something about the order of 
magnitude of the standard deviation. 

The experimental worker ordinarily has information of greater or 
less precision about the magnitude of the variance. He may have a 
rather precise estimate based on past experiments with the same popu¬ 
lation, or he may have only a guess. Usually his information is between 
these extremes, being based on past experiments with similar but not 
identical populations. We are concerned with finding a method for using 
information of this kind in planning an experimental investigation. 
However, we shall first consider the more special situation in which 
there exists an estimate of the variance with known precision. Then in 
the final section of the paper we shall return to the more common situa¬ 
tion in which the estimate of the variance is of dubious precision. The 
technique described there is admittedly crude, but it does provide a 
method for dealing with an important practical problem. 

* Most of this work was done under oontraet N7oiir371 with the Office of Naval Besearch. 
t Now at Hunter OoUege. 
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In the following sections we shall deal with the problem of determin¬ 
ing sample sizes when: 

1. The population is normal, and 

2. An estimate of the variance is available. 

Thus we assume at the outset that there is definite information about 
the variance in the form of an estimate obtained from a previous experi¬ 
ment with the same population. The methods to be presented are ap¬ 
propriate when an experiment is to be repeated once but there is no 
thou^t of successive repetition as is the case, for example, in quality 
control work. We are concerned here with the circumstance in which a 
research worker has turned up a result which, thou^ not statistically 
significant, would be important if it were reproduced in a more defini¬ 
tive experiment. Thus a person trying various combinations of materi¬ 
als in an attempt to increase the life of a certain manufactured product 
may find one which gives an estimated mean life much greater than 
that of the standard product. The confidence interval for the new mean 
however, is too broad to give positive evidence of the superiority of the 
new combination of materials, and it is desired to estimate the mean 
with greater precision in a second experiment, or to test whether the 
mean for the new combination exceeds that for the standard by a spec¬ 
ified amount in a second experiment. 

2. SAMPLE SIZE FOB A SPECIFIED CONFIDENCE INTERVAL 

Let US suppose that an experimenter wishes to estimate the mean of a 
normal population with a .99 confidence interval no longer than four 
units, and that he has an estimate, $ 1 ^= 9 , of the population variance 
with m =8 degrees of freedom. Let us further suppose that he can be 
induced to state his problem in the following manner: 

I have an estimate, nine, of the variance of a normal popula¬ 
tion with eight degrees of freedom. I wish to draw a sample 
sufficiently laige that the probability will be .95 that the 
.99 confidence interval will have length less than four units. 

It is impossible to guarantee that the confidence interval will be no 
longer than a specified amount, but one can guarantee it with prob¬ 
ability .95 (or any other given probability). 

This problem is readily solved with existing tables. Let «-|-l be the 
required sample size so that the second estimate 82 ^ of the population 
variance will have n degrees of freedom. The general relation which 
determines n (see appendix) is: 

(d/siY = m)/(n - 1 - 1 ) ( 1 ) 

where the symbols have these meanings: a is the confidence coefficient 
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of the confidence interval, p is the probability that the specified length 
will not be exceeded, d is half the specified maximum length of the 
interval, h-ain) is the 1—« critical level of Student’s t for n degrees 
of freedom, and m) is the 1—j8 critical level of the variance 

ratio for n and m degrees of freedom. 

The relation cannot be solved explicitly for n. Probably as good a 
method as any for finding n is to determine it by trial and error. One 
might, for example, compute a value V for the ri^t side of (1) using a 
trial value of «. K P turns out to be larger than (d/si)*, a larger value of 
n would be used in the next trial; otherwise a smaller value of n would 
be tried. In the specific example above where Si=3, m=8, a=.99, 
P=.95, d=2, we may try n=40, for example, to get 

V = (7.31)(3.05)/41 = .54, 

which is lai^er than (d/si)^=-444, hence a larger value of n is needed. 
After a few trials n is found to be 48 so that the required sample size 
is 49. 

Critical values of the variance ratio for =.95 and .99 are tabulated 
in almost every statistics book; for other values of j3 one may consult 
references 1 and 2 at the end of the paper. The value of in (1) is con¬ 
veniently tabulated in the F tables since Fi_„(l, to) =<?_«(«)• 

We may observe that the result, «+l=49, is considerably larger 
than what one would have obtained by employing this commonly 
used argument: The population standard deviation is about 3, hence 
the mean of a sample of size n-{-l will have a standard error of 
3/Vn-|-l. For a 99% confidence interval t.u will be 2.8 or 2.9, hen ce the 
half confidence interval will have a length of about (2.9)(3)/V'n+l. 
On putting this equal to two, one finds n-fl = 19. This argument es¬ 
sentially puts F=1 in equation (1) and corresponds roughly to putting 
i8=0.5; that is, with n-l-l = 19 one has about a 50-50 chance that the 
resulting confidence interval will have a length smaller than four. 

3 TESTS OF SIGNIFICANCE OF THE MEAN 

The problem of testing the significance of a mean is closely related 
to the problem of confidence interval estimation of the mean. Let us 
suppose the particular problem considered in section two is changed 
to read as follows: 

I have an estimate, $1^=9, of the population variance with 
m=8 degrees of freedom. I wi^ to draw a sample large 
enough that the probability will be j8=.80 that I will get 
significance at the 7=.05 level in testing whether the mean 
exceeds 10 if it actually exceeds 10 by as much as a=1.5. 
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TABLE I 

VALUES OF h -a/n FOB |3» SO 

V- 06 FOB ONE-TAILED TESTS, .10 FOB TWO-TAILED TESTS 


X 

1 

2 

3 

4 

5 

6 

8 

12 

16 

24 

32 

- 

1 

13.8 

8.52 

7.39 

6.03 

6.68 

6.51 

6.31 

6.13 

6.04 

5.06 

5.92 

5.70 

2 

5.88 

3.51 

3.02 

2.81 

2.70 

2.62 

2.53 

2.45 

2.41 

2.37 

2.35 

2 30 

3 

4.30 

2.55 

2.20 

2.03 

1.96 

1.91 

1.85 

1.78 

1.75 

1.72 

1.70 

1.65 

4 

3.55 

2.10 

1.80 

1.67 

1.60 

1.56 

1.60 

1.45 

1.43 

1.40 

1.30 

1.36 

5 

3.12 

1.85 

1.58 

1.47 

1.41 

1.37 

1.32 

1.28 

1.25 

1.23 

1.22 

1.18 

6 

2.81 

1.66 

1.43 

1.32 

1.27 

1.23 

1.19 

1.15 

1.13 

1.11 

1.10 

1.07 

7 

2.56 

1.52 

1.30 

1.21 

1.16 

1.12 

1.08 

1.05 

1.03 

1.02 

1.01 

.979 

8 

2.37 

1.41 

1.21 

1.12 

1.07 

1.04 

1.00 

.972 

.056 

.940 

.032 

.910 

9 

2.23 

1.32 

1.14 

1.05 

1.01 

.078 

.944 

.013 

.808 

.883 

.875 

.854 

10 

2.11 

1.25 

1.07 

.903 

.952 

.925 

.893 

.863 

.840 

.835 

.828 

.805 

12 

1.02 

1.14 

.075 

.902 

.865 

.840 

.811 

.784 

.771 

.758 

.752 

.732 

14 

1.77 

1 05 

.800 

.831 

.797 

.775 

.748 

.723 

.710 

.600 

.693 

.676 

16 

1.65 

.076 

.838 

.775 

.743 

.722 

.697 

.673 

.662 

.651 

.646 

.631 

18 

1.56 

.021 

.790 

.731 

.701 

.681 

.658 

.635 

.624 

.614 

.609 

.594 

20 

1.48 

.873 

.760 

.603 

.665 

.646 

.624 

.602 

.502 

.583 

.578 

.563 

25 

1.32 

.770 

.669 

.619 

.593 

.577 

.557 

.538 

.529 

.520 

.515 

.502 

30 

1.20 

.708 

.608 

.663 

.540 

.625 

.507 

.489 

.481 

.473 

.469 

.456 

40 

1.04 

.613 

.626 

.486 

.467 

.454 

.438 

.423 

.416 

.400 

.405 

.305 

50 

.926 

.548 

.471 

.436 

.417 

.405 

.391 

.378 

871 

.365 

.362 

.353 

60 

.844 

.490 

.429 

.306 

.380 

.369 

.356 

.344 

.338 

.333 

.330 

.322 

80 

.730 

.432 

.371 

.342 

.328 

.319 

.308 

.208 

.202 

.288 

.285 

.278 

100 

.652 

.385 

.331 

.306 

.293 

.285 

.275 

.266 

.261 

.257 

.255 

.249 


The size of the specified mean is not relevant to the problem; the criti¬ 
cal figure is a=1.5. In order that the sample size be determined it is 
necessary that the experimenter come to a decision about the magni¬ 
tudes of jS, 7 , and a. 

Some new tables are required to solve this problem, one for each com¬ 
bination of values of 7 and p. Two are presented here—one for 7 =.05, 
l8=.80, and one for 7=.05, P—.95. The theoretical basis of the tables 
is given in the appendix. The number of degrees of freedom n in the 
second estimate s*® of the variance is found by computing the number 

k = a/si ( 2 ) 

then locating that value in the column headed by m; the number op¬ 
posite k in the column on the left is the required value of n. In the ex¬ 
ample given above, fc=1.6/3=0.5. This value is foimd in the column 
headed by 8 in Table I to correspond to n=31, so that the sample 
size would be 32. 

Interpolation in these tables is nearly linear on the reciprocal of m 
and is also nearly linear on the reciprocal of the square root of n. 
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tabus u 


VALUES OF i!-a/«t FOR fi-JdS 

7-.06 FOB ONE-TAILED TESTS, .10 FOB TWO-TAILED TESTS 


X 

1 

2 

3 

4 

5 

6 

8 

12 

16 

24 

32 

00 

1 

57.1 

19.5 

14.4 

12.6 

11.6 

11.0 

10.4 

9.85 

9.58 

9.33 

9.21 

8.86 

2 

24.2 

7.74 

5.60 

4.77 

4.39 

4.16 

3.86 

3.61 

3.49 

3.38 

3.33 

3.19 

3 

17.6 

6.58 

4.03 

3.39 

3.13 

2.94 

2.74 

2.55 

2.46 

2.39 

2.35 

2.23 

4 

14.5 

4.58 

3.28 

2.79 

2.56 

2.40 

2.23 

2.08 

2.01 

1.94 

1.91 

1.82 

6 

12.6 

3.97 

2.88 

2.41 

2.23 

2.09 

1.93 

1.82 

1.76 

1.69 

1.66 

1.58 

6 

11.2 

3.55 

2.57 

2.17 

2.00 

1.88 

1.73 

1.62 

1.57 

1.52 

1.49 

1.42 

7 

10.3 

3.26 

2.36 

1.99 

1.83 

1.72 

1.58 

1.48 

1.43 

1.38 

1.36 

1.30 

, 8 

9.70 

3.05 

2.19 

1.86 

1.70 

1.60 

1.48 

1.39 

1.34 

1.29 

1.27 

1.21 

9 

9.12 

2.87 

2.06 

1.75 

1.60 

1.50 

1.39 

1.30 

1.26 

1.21 

1.19 

1.13 

10 

8.62 

2.72 

1.95 

1.65 

1.51 

1.42 

1.32 

1.23 

1.19 

1.15 

1.13 

1.07 

12 

7.83 

2.47 

1.77 

1.50 

1.37 

1.29 

1.20 

1.12 

1.08 

1.04 

1.02 

.971 

14 

7.22 

2.28 

1.63 

1.38 

1.26 

1.19 

1.11 

1.03 

.993 

.959 

.942 

.893 

16 

6.73 

2.13 

1.52 

1.29 

1.18 

l.U 

1.03 

.959 

.924 

.893 

.878 

.834 

18 

6.35 

2.01 

1.44 

1.22 

1.11 

1.04 

.972 

.904 

.872 

.842 

.828 

.785 

20 

6.02 

1.90 

1.36 

1.15 

1.05 

.991 

.921 

.858 

.827 

.798 

.785 

.744 

25 

5.37 

1.70 

1.22 

1.03 

.940 

.884 

.822 

.765 

.738 

.712 

.700 

.663 

30 

4.89 

1.54 

1.11 

.935 

.855 

.804 

.748 

.695 

.671 

.647 

.636 

.605 

40 

4.23 

1.33 

.962 

.809 

.739 

.696 

.646 

.601 

.580 

.560 

.550 

.525 

50 

3.78 

1.19 

.854 

.722 

.661 

.622 

.577 

.637 

.518 

.500 

.492 

.469 

60 

3.45 

1.09 

.778 

.658 

.602 

.567 

.525 

.490 

.472 

.456 

.448 

.428 

80 

2.98 

.940 

.672 

.569 

.520 

.490 

.454 

.423 

.408 

.395 

.388 

.369 

100 

2.67 

.840 

.600 

.508 

.465 

.438 

.405 

.378 

.365 

.353 

.347 

.329 


When the computed value of ib is smaller than the final entiy in the col¬ 
umn of the table, the sample size will be larger than 100 and may be 
computed as follows: 

n = 100(X/A!)», (3) 

where K is the tabular value for n=100 in the column headed by m. 
In the illustrative example, if a were specified to be 0.6 instead of 1.5, 
the value of k would be 0.2 and n would be foimd to be 

n = 100(.275/0.2)* = 189. 

The example we have considered was stated so that a one-sided test 
of significance was implied. The experimenter was interested only in 
whether the mean exceeded the given figure, ten; he had no interest in 
a smaller mean however small it might have been. He would compute 
Student’s t from his second sample and if it were greater than 
(31)=1.695 he would conclude at the 5% levdi of significance that the 
mean exceeded ten. We have used the subscript .10 because the tables 
for critical levds of t ordinarily include both tails of the distribution m 
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the critical region. The sample size 32 gives him 80% assurance that 

t = {x - 10 )/s 2 (4) 

will exceed 1.695 if in fact the population mean is 11.5 (x is the sample 
mean and sj the sample standard deviation). 

If one wishes to make two-tailed tests, then Tables I and II may be 
used provided the level of significance is doubled. Thus, to test whether 
the mean differs significantly in either direction from 10 in the il¬ 
lustrative example, one could again use the sample size 32 but the test 
would be at the 10 % level of significance. 

Now let us consider the possible errors that may be made in the con¬ 
clusions in relation to |3 and y which are .80 and .10 respectively in the 
problem at hand. If the population mean is ten, then the sample may 
give rise to a value of t lai^er than 1.695 with probability .05. The total 
probability of obtaining significance in either direction (the type I 
error of Neymanand Pearson) is . 10 . If the population mean is 8.5, the 
sample may give rise to a value of t which is not smaller than —1.695, 
and one will fail to detect at the .10 level of significance that the mean 
is smaller than ten; the probability of this error (the t 3 rpe II error of 
Neyman and Pearson) is 1 —j 8=.20 in the present instance. A similar 
situation occurs when the mean is actually 11.5; the probability is .20 
that the sample will fail to show the mean to be significantly larger 
than ten at the .10 level of significance. When a type II error is made it 
will usually consist of the statement that the mean does not differ 
from ten. However there is a remote chance that an extreme error will 
be made—^that the population mean will be declared smaller than ten 
when m fact it is 11.5, or declared larger than 10 when it is actually 
8.5. The probability of such an extreme error is about .0006 for Table 
I and about .0002 for Table 11. Thus in the given problem, if the true 
mean is 11.5 and one uses a sample of size 32 to test whether the mean 
differs from ten at the .10 level, the probability is about .0006 that the 
sample will give rise to a value of t which is smaller than —1.695, and 
thus lead one to the conclusion that the population mean is less than 
ten. 

4. DIFEERENCES BETWEEN TWO MEANS 

If one wishes to estimate the difference b between two population 
means knowing that the populations are normal and have the same 
variance, he may draw a sample of size n-|-l from’each population and 
put 

(a — X* — 5)\/n + 1 


■s/s2* St* 


( 5 ) 
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equal to ± 2 i-a( 2 n) and solve for h to obtain the limits of the a confi¬ 
dence interval. (£ and S' are the two sample means, and sj* and s's* 
are the two sample variances.) Suppose one wants n +1 to be large 
enough that the probability will be j 8 that the confidence interval will be 
shorter than 2d, and that an estimate with m degrees of freedom has 
been previously obtained for one of the populations. The equation 
which determines n is 

dV2si* = t\-.„{2n)Fi-p(2n, m)/{n + 1) (6) 

as follows directly from equation ( 1 ). The divisor 2si® arises from the 
fact that the variance of the difference between two observations is 
2 o^ if each observation has variance Equation ( 6 ) may be solved by 
trial and error for n. 

In testing whether one mean is laiger than another by an amount a 
one proceeds exactly as in the preceding section except that the quan¬ 
tity h to be found in the table is defined by 

h = o/ 2 si. 

The number n thus determined is the number of degrees of freedom in 
the pooled estimate of the variance; the size of the sample to be drawn 
from each population is therefore (n-|-2)/2 if n is even or (w+3)/2 if 
n is odd. This sample size will insure that a difference in means of size 
a will be detected at the .05 level with probability .80 or .95 according 
to whether Table 1 or Table II is used to determine n. Here again it is 
assumed that the two populations are normalJand have the same 
variance. 

6 REPLICATION OP EXPERIMENTAL DESIGNS 

Tables I and II may be used to aid one in making a decision about the 
number of replications to be used in a designed experiment. Suppose 
p normal populations with means Ci, cs, * • •, Cp and a common vari¬ 
ance are to be sampled; the means may represent treatment effects, for 
example, in an agricultural or industrial experiment. The most satis¬ 
factory way to deal with this problem is to have the experimenter specify 
what size treatment sum of squares he wishes to detect; that is, specify 
the size of 

S (c. - 

However, the solution of the problem in this form requires triple entry 
tables (the three variables being the munbers of degrees of freedom in 
the preliminary estimate of the error variance, the error variance of 
the experiment, and the treatment sum of squares). Since we had not 
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the facilities to compute such tables, we shall have to consider the 
problem merdy in terms of the differences between pairs of means. 

Let us suppose that a number of treatments or treatment combinar 
tions are to be studied in an experimental design with r replications, 
that the number of degrees of freedom in the error variance at* for the 
proposed experiment is n, that any treatment mean will be based on g 
observations, and that an estimate Si* with m degrees of freedom of the 
error variance is available. Two treatments with means and £j will 
be compared by computing 

i = Vg (£• — £/)/V2 $2 

and comparing it with a critical level of i with n degrees of freedom. The 
numbers n and g will be functions of r. 

The size of a difference betwem treatment means that has an 80% 
chance of being detected at the .05 level (in a given direction) may be 
computed for any proposed value of r by means of the relation, 

~ (7) 

where k is the mtry in Table I corresponding to m and n. Similarly, 
using Table II to determine k, one may find the difference which has a 
95% chance of being detected at the .05 level. 

As an illustration, suppose six treatments are to be tested in a ran¬ 
domized block experiment with r replications and that we have an es¬ 
timate, Si®=*9, of the error variance with m=8 degrees of freedom. We 
then have g=r and n=5(r—1). For r=2, for example, a difference in 
treatment means of 

o = Vo (1.32)3 = 9.7 

has an 80% dxance of being detected in a particular comparison. With 
five replications, a is reduced to 

/42 

® “ r 7 

A few trial calculations of this nature for feasible values of r will provide 
one with an objective baste for conung to a decision about the number 
of replications to be used. 

S. USE 07 OUESSED ESTIMATES 07 THE VARIANCE 

It often happens that while an esperimenter has no specific informa¬ 
tion about the variance of a distribution in the form of an estimate 
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Si^ he nevertheless has some more or less reliable opinion of its order of 
magnitude. He may, for example, have done considerable experimenta¬ 
tion with several similar populations and thus have knowledge of the 
general range of variation in such populations. When an experimenter 
has information of this kind he would naturally like to use it, and of 
course he should do so—^it will improve the efficiency of his experi¬ 
ments. 

The question arises as to how one may quantify information of a 
general character; more precisely, how may one convert the informa¬ 
tion into numerical quantities which may be used for m and Si*? One 
simple method is for the experimenter to place what he feels to be 
reasonable upper and lower limits on the standard deviation. We may 
call these numbers 0 and H, and let us suppose that the experimenter 
has selected them in an effort to make it a fair nine-to-one bet that they 
will include the population standard deviation. In other words, G and 
H are to be regarded as 90% confidence limits for the standard devi¬ 
ation. One may then use the average of G and H. for si.: 

Si = 1/2(G + H), 

and determine the degrees of freedom, m, to be associated with Si by 
computing G/H and consulting Table III. The second line of this table 
consists merely of the square roots of the ratios of the .95 and .05 levels 
of chi-square. 

TABLE m 


DEGREES OP FREEDOM ASSOCIATED WITH VARIOtTB RATIOS 
OP LIMITS ON THE STANDARD DEVIATION 


m 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

Q/B 

31.3 

7.6 

wm 


3.1 

2.8 

2.6 

2.4 

2.3 

2.2 

Olh 

3.6 

2.2 

1.84 

■BH 

1.58 

1.51 

1.46 

1.42 

1.39 

1.36 


If one has no firm feeling that his upper and lower limits really cor¬ 
respond to nine-to-one odds, he may prefer to select upper and lower 
limits, g and h, in such a way that he would have no choice between 
sides of an even-money bet that they include the standard deviation. 
Again the average of g and h would be employed as the estimate of 
8i, and m would be determined by the third line of Table III which 
was computed using the .75 and .25 levels of chi-square. This method 
is, in fact, to be preferred over the first method because one may 
ordinarily be expected to have a more accurate conception of a fair 
even-money bet than a fair nme-to-one bet. 
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As an example, let us suppose 7 and 12 have been selected as 50% 
limits on a standard deviation. The estimate St is 9.5, and since the 
ratio g/h is 1.7, there are about 4 degrees of freedom associated with 
the estimate. If one wishes to be conservative he may prefer to call it 
3 degrees of freedom, but ordinarily the nearest figure may as well 
be selected. Only in rare occasions would there be any point in inter¬ 
polating between 3 and 4, though of course there is no reason why 
a fractional value of m could not be used. 


appendix: 

A sample of size »+l is to be drawn so that the probability will be 
P that an a confidence interval for the mean will be no longer than 
2d; an estimate sj? of the variance with m degrees of freedom is avail¬ 
able. If is the sample variance of the ne w sam ple, the half-length of 
the confidence interval will be ti-aMsi/Vn+l; it is desired that n 
be such that 


or 


P[ti-a(n)8i/-\/n -H 1 < d] = iS 


P 



Vn + 1 d"j 

ti-a(n) J 


= /S. 


For any «, it is known that 

P(s2 < m)si) = p 


( 8 ) 

(9) 

( 10 ) 


where Pnj(«, m) is the critical level of the variance ratio for n and m 
degrees of freedom. On equating the right-hand sides of the inequalities 
in (9) and (10) one obtains the relation given in equation (1) for deter¬ 
mining n. 

We now turn to the derivation of Tables I and II. Suppose £ is the 
sample mean and that the null hypothesis =0 is to be tested against 

alternatives m>0 at the y level of significance. Ho would be rejected if 

V» + 1 */S 2 > Uy(,n) (11) 

where we have used the subscript 2y in accordance with the usual no¬ 
tation of t tables. If the mean is actually a>0 it is desired that ( 11 ) 
occur with probability P; we want 

P[\/n 1 i/st > ^ 27 ( 71 )] — P 

which may be written 

P[<> Uy{n) - \/jr+T 0 /S 2 ] - P (12) 

where f=>Vn+I(i—o)/s 2 has the Student distribution with n degrees 
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of freedom. In (12) we shall put jD=V«+l o/si and F=«iV«s* to ob¬ 
tain 

P[t> htin) - DVF].= /S. (13) 

We diall use this rela tion t o determine a value Do for given values of 
m and «, then Do/Vn+l corresponds to a particular value of o/«i 
given in advance, and n+l wiU be the required sample size for that 
value of a/s\. In determining Do we assume t and F have a distribution 
determined by the distributions of z, Si, So. This is legitunate for only 
one use of Si; clearly for repeated use of to determine sample sizes 
one would have to evaluate D from (13) using the conditional distribu¬ 
tion of F given Si, and that conditional distribution involves the un¬ 
known variance. We thus in effect treat the problem as if the question 
of determining the same size arose in advance of the information in Si. 
(An excellent discussion of this general problem has been given by 
Stein in reference 4.) 

The variates t and F in (13) are not independent since they are both 
functions of Sg; however their joint distribution is readily found as 
follows: Since x, Si, and Sg, are mutually independent their joint density 
function is the product of their individual densities. On transforming 
the three variates to t, F, and sg, and then integrating out sg one finds 
the density of t and F to be 

f(t, F) = + mF H- (14) 

C being a function of m and n. 

To facilitate the numerical integration needed to evaluate the in¬ 
tegral implied by (13), further transformations were made. If one lets 

= (to 4 - n)P/{mF + ») 
y = mFf (mF + n) 
the density function reduces to 


g(x,y)- 


^ TO+n-H^ 


V ir(TO-l-») 

(n -A \ 


\ 2 / \ TO-h« / 

^/TO-|-n\ 

" 

V 2 / 

. y(»-g)/g(l_j^\(»-g)/S 

/to\ /to 

) 


(TO-f-»+l)/2 
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SO that X and y are independently distributed and by tabulated func¬ 
tions, X having the Student distribution and y the beta distribution, 
Nevei*theless one numerical integration is required to evaluate the 
probability in (13), and since we fixed p in advance it was necessary 
to do several such integrations for tr ial values of D to determine each 
entry in Tables I and II, D/\^n+l rather than D is tabulated. The 
entries corresponding to m=1, 2, 4, 8,16, 32 and w = 1, 2, 4, 16, 32, 64, 
128 were determined by integration; the other entries were filled in by 
interpolation. The level of accuracy of the tabular values is about one 
in 300; an entry such as 9.12 may be in error by three units in the last 
place. The entries in the right hand column of each table were taken 
from the Neyman and Tokarska (reference 3) table of the power of the 
t test. 
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APPLICATION OF THE THEORY OF EXTREME 
VALUES IN FRACTURE PROBLEMS* 


Benjamin Epstein 

Coal Research Laboratory^ Carnegie Inshtute of Technology 

In this paper it is shown that the theory of extreme values 
is pertinent to the treatment of certain aspects of the fracture 
or bieak-down of materials used in modern technology. An at¬ 
tempt is made to integrate some of the results scattered 
through the technical hterature. 

INTRODUCTION 

D uring the past twenty-five years there has been an increasing in¬ 
terest in gaining a deeper insight into the laws governing some of 
the phenomena observed in connection with the fracturing of metals, 
textiles, glass and other materials used in modem technology. Among 
other things, one would like to account for such things as the skewness 
of the distribution of breaking strengths of supposedly identical sam¬ 
ples and the dependence of strength on specimen size. 

It appears, from a survey of the literature on this subject, that work¬ 
ers in many different fields have obtained special results without realiz¬ 
ing fidly the general setting of the problem. It is our purpose in this 
paper to indicate how the application of the theory of extreme values 
leads to a resolution of some of the pertinent questions. 

SURVEY OP PIEIiD 

In essence, all the statistical models proposed in the study of frac¬ 
ture take as a starting point Grffith’s theory^ which states that the 
difference between the calculated strengths of materials and those ac¬ 
tually observed resides in the fact that there exist flaws in the body 
which weaken it. Accepting this point of view is equivalent to assuming 
that there will be a distiibution of strengths in a given specimen in the 
sense that a different amount of force will be needed to fracture a speci¬ 
men at one or another point. There are many physical situations where 
failure at one point means failure of the entire specimen. But this simply 
means statistically that it is the worst flaw among N flaws (where N 
is the number of flaws in the specimen) which determines the strength 
of a specimen and therefore the theory of extreme values is immediately 

* F^ented at the 107th AnT>iin.l Meeting of the American Statistical Association, December 30 
1047, m connection with the program on *The Theory of Extreme Values and Its Apidications.* 

1A A Griffith, “The Phenomena of Rupture and Plow m Sohds,” Philoaophieal Transaetuma cf ihe 
Royal Soetety, $giA, 163 (1020), “Phe Theory of Rupture,” Prowdtngo of the Pvrst XfOerruOumal Coftr- 
gross for Applted Moehantco, t. Delft, p 55 (1924) 
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applicable. Clearly N increases as the specimen size increases and, 
therefore, the dependence of strength on specimen size is equivalent 
statistically to the problem of how the distribution of smallest (or 
largest) values depends on N. Generally speaking, asymptotic theory 
is applicable since N, the number of flaws, is large in most practical 
problems. 

As far as is known to the author, the first worker to realize the con¬ 
nection between specimen strength and distribution of extreme values 
was F. T. Peirce* of the British Cotton Industry Association. The ap¬ 
plication of essentially the same ideas to the study of the strength of 
materials was carried out by the Swedish phsrsicist and engineer, 
WeibuU.* He, unlike Peirce who assumed a Gaussian distribution of 
strengths in the locality of flaws, made the assumption that Fq{S), 
the probability of breai^ge as a function of the stress S is given by 
— where So and m are unknown parameters depend¬ 

ing on the characteristics of the material under test. Experimental data 
drawn from the fields of textiles, metallurgy, dielectrics, and metal 
fatigue indicate that WeibuU’s assumptions are reasonable. Recently 
his views were partially corroborated by Davidenkow, Shevandin and 
Wittman.^ 

The Russian physicists, Frenkel and Kontorova,” were the next to 
study these problems. Starting with a distribution of flaw strengths 
X given by the probability density function, 

/(*) = —-i=— 

■\/2ir O' 

they found the most probable value of the strength of specimens of 
volume V large enough to contain many flaws is approximately given 
by _ 

(1) n — 0 V 2 log nF — 2 log 2Vr’ 

where n is the average number of flaws per cubic centimeter of the 
material. This result states that the strength of specimens imder the 
Frenkel-Kontorova assumptions should decrease linearly with Vlog F. 

> F. T. Peirce, "Tensile Tests for Cotton Yams V. *The Weakest lank*—^Theorems on the Strength 
of liong and of Composite Specimens,* Journal of the TextUe InatihOe, Transactions, 17, 356 (1926). 

* W. Weibull, "A Statistical Theory of the Strength of Materials,* Ing. Vetenakaps Akad. Handl., 
No. 151 (1939); "The Phenomenon of Rupture in Solids,* Ibid., No. 163 (1939). See also John Tucker, 
"Statistical Theory of the Effect of the Dimensions and Method of Loading upon the Modulus of Rup¬ 
ture of Beams,” Proceedings of the American Society of Teeting Materials, 41, 1072 (1941). 

* N. Davidenkow, E. Shevandin and F. Wlttman, "The Influence of Sue on the Brittle Strength of 
Steel,” Journal of Applied Mechanics, I4, No. 1, A63-^7 (1947). 

s T. A. Eiontorova, J. Tech. Phya. U.S.S.R., 10, 886 (1940); J. I. Frenkel and T. A. Eontorova, "A 
Statistical Theory of tl^ Brittle Strength of Real Crystals,* J. Phys. Z7.S.S.R., 7,108 (1943). 
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This should be compared ■with Weibull’s result to the effect that the 
most probable value of the strength of specimens of volume V is pro¬ 
portional to SoV~^”'. Frenkel and Kontorova consider the discovery 
of (1) to be very important and to justify the assumption that the 
local distribution of strengths due to flaws is Gaussian. Statisticians 
familiar ■with the theory of extreme values ■will recognize (1) as due in 
more precise form to R. A. Fisher and Tippett* and Gumbel.^ The more 
precise formula for (1) is 


M — 


<r\/2 log nF + 


log log nV -|- 4a- _ 
2\/2 log nV 


Beginning -with World War II several scientists in this country be- 
came interested in statistical theories of fracture. Some work was done 
by Ruark and Rosen* of the University of North Carohna under con¬ 
tract with the Navy Department. More recently two metallui^ts, 
J. C. Fisher and Hollomon,* have proposed a statistical theory of 
fracture based on the idea that the material tmder study is an elastic 
solid containing many thin disc-like cracks -with elliptical cross-sections. 
The major axes of the ellipses are assumed to be of size x possessing 
distribution p(x) a:^0. Brooks*® of the Westin^ouse Electric 

Corporation has presented experimental e-ridence indicating the im¬ 
portance of the theory of extreme values in understanding the break¬ 
down of paper capacitors. Collaboration between Brooks and the 
author** has resulted in a paper which contains results useful to design 
engineers in the field of dielectrics. It is shown, among other things, 
that the most probable value of the breakdo-wn strength V(A) of 
capacitors of area A is related to A by a relationship of the form 

(2) F(A) = a-i8logA, 


where a and /3 are positive constants determined by the number of 
flaws (technically known as conducting particles) per unit area and by 
the particle size ^tribution. 


* R. A. Risher and L. H. C. Tippett. 'Limiting Form of the Frequency Distribution of the Largest 
and Smallest Member of a Sample,” Proceedings of the Cambridge PhUoeophical Society^ S4» 180 (1928). 

7 E. J. Qumbel, 'Lea valours extrdmes des distributions statistiques,” Annales de VInstitui Henri 
Poineari, 6, 116 (1936). 

■ A. E. Ruark and N. Rosen, 'Statistical Effects in the Testing of Materials,” Naval PeeeanA 
Laboratory Report No. 0-2191. 

• J. C. Fisher and J. H. Hollomon, 'A Statistical Theory of Fracture,” American Institute of 
ing and Metallurgical Engineers, Tech. Pub. No. 2218, Metals Technology^ August 1947. 

10 H. Brooks, 'The Probable Breakdown Voltage of Paper Dielectric Capacitors,” American Inslir 
tute of Sleetrical Engineers, Technical Paper, 47-164,1947. 

u B. Epstein and H. Brooks, 'The Theory of Extreme Values and Its Implications in the Study of 
the Dielectrio Strength of Paper Capadtors,” accepted by the Journal o/Applied Physics, 
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All of the papers mentioned thus far were based essentially on the 
concept that fracture is determined by the weakest link. A much more 
complicated and very interesting problem has recently been considered 
by H. E. Daniels.^ This paper is devoted to studying the strength of 
bundles of threads where the load S is now applied to a set of elements 
in paraUel and not in series. Whereas in the series case each element 
carries the full load S, in the parallel case each thread initially carries 
load S/n. It turns out that a bundle will not break under a load S if, 
and only if, there exists an integer k^n such that k among the threads 
have strength exceeding S/k. The problem of the distribution of strength 
of bundles of n threads can no longer be treated as a problem in the 
distribution of smallest values. It turns out that whereas the distribu¬ 
tion of strengths of single long strands is negatively skewed, the distri¬ 
bution of strengths of bimdles of n threads is asymptotically normal 
for lai^ n xmder fairly general assumptions about the distribution of 
strengths of individual strands. 

Daniels’ result has been mentioned here even thou^ it is not a 
purely “weakest link” problem because of its possible importance in 
stud 3 ring certain aspects of the strength of materials. In any real prob¬ 
lem the situation is probably neither precisely described by elements 
in series or parallel, but rather by elements distributed in some rather 
complicated way. However, it is of value to know what to expect in 
certain limiting idealized models. 

THE IMPLICATIONS 07 VABIOUS DISTBIBITTION ASSUMPTIONS 
IN BBEAKDOWN PROBLEMS 

There is considerable dispute in the technical literature as to which 
assumptions concerning the distribution of flaw strengths or distribu¬ 
tions of sizes of cracks or conducting particles are the correct ones. Of 
course, this sort of question cannot be answered purely on statistical 
groimds. Adequate experimental data of the right type must be avail¬ 
able before one can decide, in a reasonably rigorous way, what the 
underl 3 dng distribution laws are. At the moment the situation seems 
to be as follows: (a) there is a certain amount of data drawn from a 
variety of fields which i>artially substantiates Weibull’s assumptions; 
(b) many experimenters have observed that the most probable value 
of the strength decreases as some function of the logarithm of the size 
of specimen; (c) the distribution of strengths of specimens all of the 
same size appears to be negatively skewed; (d) in the breakdown of 

u H. £. Daniels, "The Statistical Theoiy of the Stiwogih of Bundles of Threads. I,” Proctedings of 
ihe Botfdl Society, London, 18S, 405,1945. 
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capacitors there appears to be ample justification for assuming that the 
sizes of conducting particles are distributed according to a size distri¬ 
bution of the form z>0. It can then easily be shown that 

the most probable value of the breakdown voltage will decrease lin¬ 
early with the logarithm of the area. Ph 3 rsical data substantiate this 
and other conclusions. 

The physical implications of various distribution assumptions are 
discussed in a recent paper.** In treating such questions one is generally 
justified in working with the asymptotic portion of extreme value 
theory since the number of defects involved in fracture problems is 
large. It is a very important fact that the distribution of extreme values 
in samples of size n (n—»<») possesses a kind of stability with n playing 
in a certain sense the role of a parameter. It is shown, for example, in 
Cram4r** that if samples of size n are drawn from a population de¬ 
scribed by the continuous p.d.f./(a:) and associated d.f. F(x) = ff»f{x) 
dx, then the asymptotic frequency distribution of y, the smallest value 
in samples of size n(n-» «) is given by 

(3) h(y) = 

The d.f. is 

(4) H(y) = 1 - e-»^w. 

It is convenient to introduce a random variable v which is related to 
the random variable y by the relation^p n=nF{y), (3) 

then assumes the simple form 

(5) hill) = er^, 0 ^ ij g ». 

Similarly, the a^junptotic p.d.f. of x, the hugest value in sample of size 
n, is given by 

(6) hix) — 

The asymptotic d.f. is 

(7) Hix) - 

If a random variable $=n[l—F(a;)] is introduced, then ( 6 ) becomes 

(8) hii) = <r«, 0 ^ I g n. 

Equations (3) through ( 8 ) form a basis from which a number of 
physical conclusions of interest can be drawn. Equation (4) has a 
direct physical interpretation in the class of problems under considera- 

u B. Epstein, "Statistical Aspects of Fracture Problems,* Journal of Applied Pftvnes, 19, 140-147 
(1948). 

M H. Oiaiii4r, MoJhmaiieal MiAhodt of StaHsHea, Rinoeton UniTsisity Press, pp. 873-878,1948. 
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tion. Imagine that one is breaking one-dimensional threads possessing 
A flaws per imit length and with flaw strength distribution/(a;). Then 
AZ/ is the expected number of flaws m a specimen of length L and 
ALF(x) is the expected number of flaws having strength ^ a; in speci¬ 
mens of length L. But, as is presumably (but not necessarily) the case, 
the flaws are distributed individually and collectively at random, and 
therefore it can be assumed that the number of flaws having strength 
in specimens of length L is a random variable distributed according 
to the law of Poisson. Therefore, the expected fraction of specimens of 
length L having strength is given exactly and not merely asymp¬ 
totically by 

(9) H{x) = 1 - 

The identification of (9) with (4) is complete since AL plays the role 
of the sample size n, A similar interpretation can be attached to (7). 

If /(re) is assumed to be one of a number of common distribution 
functions, the application of (4) and (5) yields the results in Table I. 
The results for the first two distributions will be found in Cram6r.^^ 


TABLE I 


Probability Density Functions 

1. Laplace's distribution 

/(*) = — A 
2X 

2. Gauss' distribution 
V2»-' 


3. Distribution of type assumed by WeibuU 




Asymptotic Distribution of SmaHest Values 
in Samples of ^se n (n large) 


y = tJi log n/2 + X log v 
or 

1 / = — X log n/2 + Xr 

y ss fi ^ ff ^2 log n 

, _ log log n + log 4ir , c log q 

"t* ’ .... , -— 

2 V2 log n V2 log n 

or 

y «» — <r V2 log n 

^ ^ loglogn +4x , erf 

2 V2 log n V2 log n 
V - (n/w.)*'® 
or 

log y « - 1/p log na + t/P- 


n is distributed uith p.d.f. y ^0 and f is distributed 'with p.d.f. giS) 




co<r<'« 


Since f, the random portion of the smallest value in samples of 
size 71, is distributed as a distribution which is strongly skewed 

to the left, it can be said that: (a) if the distribution of flaw strengths 
foUows the Laplace law, Gaussian law, or more generally one of the 
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typeAe~^ lai^e values of |x—fi| (where A, and pare posi¬ 

tive constants), then the distribution of strengths of samples all of the 
same size is negatively skewed. More precisely, if P(y) is Ihe fraction of 
specimens of strength ^y, then log log 1/1 —P{y) is a linear function of 
p; (b) if the distribution of flaw strengths follows a Weibull distribution 
then the distribution of the logarithms of the strength of samples all 
of the same size is negatively skewed. More precisely, if P(y) is the 
fraction of specimens of strength then log log 1/1 —Piy) is a linear 
function of log y. 

There are physical data at present available which follow each of 
these patterns. For example, the distribution of breakdown voltages 
of capacitors^'" all of the same size is of type (a). Weibull* gives several 
examples, drawn from a number of fields, which are of type (b). 

In the study of size effect, i.e., how specimen size affects the distribu¬ 
tion of strengths, it is important to see what happens in Table I when 
the size of sample changes. In particular, it is of interest to see how the 
most probable value (i.e., the mode), the mean, and variance of the 
smallest value in samples of size n change with n. To do this note that 

(10) The most probable value of f = f = 0, 

The mean value of f = E(i') — f (log z)e~‘dz 

Jo 

= - .577 and 

The variance of f = F(f®) — [F(f)l* 



(log Z)*6“*£fe 


and 


It then follows that 


= Z-V6, 

(11) The most probable value of is [(fl — l)//3l"^ 
^(,i//») = r and 


(12) = /i — X log (n/2), jf = j — 0.577X, 

<7,* = X*v*/6, 
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if the original population is a Laplace distribution with parameteis 

(#*, ^)> 

/io\ - /ST- I loglogn + 4T 

(13) y = II- o-v/2 log « + 0 - - - ^ 7 ^ ■ - ' > 

2v 2 log n 

= y — 0.577«r/V^T^li, 

O'#* = <j-V/12 log n, 

if the original population is Gaussian with parameters (jt, a), and 

1 /B - i\i« 


if the original population is of the WeibuU type with parameters 
(a, jS). z denotes the mode of the original WeibuU distribution. 

Equations (12), (13) and (14), when interpreted physically, give 
definite information on how the size effect depends on the distribution 
of strengths in the vicinity of flaws. In each instance the specimens be¬ 
come weaker as V, the specimen size, increases. In the first two cases 
the relationships are both semi-logarithmic in cha racter (jf decreases 
linearly with logr V in the Laplace case and with \/log V in the Gauss 
case). However, the spread of the distribution remains unchanged in 
the Laplace case, but becomes narrower with increasing V in the Gauss 
case. Under the WeibuU assumption log y (asweU as log y) decreases 
linearly with log V and the spread also decreases. 

It can be shown similarly that if the flaw strength distribution is of 
the form for large values of | ®—jtt| (where A, B, n, and p are 

positive constants), then the most probable value of the strength wUl 
decrease proportionately to Qog n)^K If the distribution is rectangular 
then there would be no size effect. The results in this paragraph, in 
Table I, and in (12), (13), and (14) furnish a statistical basis for testing 
which assumptions on the imderlying flaw strength distributions are 
reasonable. In the last analysis definitive answers of physical interest 
must await the accumulation of more pertinent physical data. Some of 
the techniques used in analyzing such data are closely related to those 
used by Gumbd on the theory of floods” and distribution of oldest ages 
at death.” 
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It should be mentioned that in some strength problems it is conven¬ 
ient to work with the distribution of largest values in preference to the 
distribution of smallest values. This is particularly true in studying the 
breakdown strength of capacitors. It is fairly well established that one 
of the major causes of breakdown of capacitors is the presence of flaws 
known technically as conducting particles. These particles are present 
no matter how carefully capacitor paper is made or how carefully the 
capacitor itself is manufactured. The conducting particles ate spread 
at random throughout the area of the capacitor and, depending on their 
size and orientation, create a local weakening of the capacitor by re¬ 
ducing the nominal insulation thickness in the neighborhood of the 
flaws. But the voltage required to break down a capacitor is equal to 
the voltage required to break it down at its weakest point, or equiva¬ 
lently at the place where the greatest effective penetration of the in¬ 
sulation has occurred. Therefore, the distribution of largest conducting 
particle sizes is pertinent in this problem. Working with largest particle 
sizes in a sample also appears to be a good way to treat the metallurgical 
problem of Fisher and Hollomon.* If this is done it turns out that their 
statistical and physical assumptions taken together imply that the 
strength of specimens of size V should fall off as the reciprocal of 

>/i5iT. 


CONCLUSIONS AND GENERAL REMARKS 

When the problem of fracture is viewed in its most general setting it 
is clear that the “weakest link” concept, when applicable, may be the 
key to the occurrence of certain observed rdhtionships between the 
strength of specimens and their size. The term strength is meant to be 
quite general in scope: we may have in mind mechanical strength, elec¬ 
trical strength, resistance of painted specimens to the corrosive effects 
of the atmosphere, or perhaps the life span of a device which ceases to 
function when any one of a munber of vital parts breaks down. From 
a statistical point of view phenomena of this typo, though different at 
first glance, arc really equivalent since they load in each instance to 
the same problem, namely, the distribution of smallest values in lai^o 
samples. This may accoxuit for the fact that workers in many different 
fields have observed skewed strength distributions and approximately 
semi-logarithmic dependence on size. As a matter of fact, the consider- 


u E. J. Gumbel. *Tho Hotum Potiod of Flood Flows,” Annals of Maih Stof., lH, 163,1041. 

E J. Qumbel, ''la, dui6) extrdme do la vie humaine,* AjCitwA%Us SGtsnhfigttes el Industridtes, 
Hermann ot Cie, Patis, 1937. 
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ations in this paper indicate that under certain hypotheses there is a 
statistical compulsion for such results. 

Many scientists accept the fact that the Gaussian distribution plays 
a fundamental role in science, and, in fact, there are many who feel 
that this is the only distribution which nature calls truly her own. It has 
been shown in this paper, however, that in a certain class of phenomena 
the typical distributions are far from normal, and are, in fact, often 
strongly skewed to the left. There are other instances lying outside the 
scope of the present considerations where still other "characteristic” 
distributions are found. For example, the particle sizes arising from 
crushing and grinding have a marked tendency to follow a logarith- 
mico-normal form. 

In conclusion We wish to emphasize that the "weakest link” hypoth¬ 
esis cannot be expected to explain all phenomena related to strength 
of materials. In this paper it has been assumed that the flaws are dis¬ 
tributed at random and do not influence one another in any way. 
Either of these assumptions may be totally or in part untrue in certain 
physical situations. For example, putting a notch on the surface of a 
beam subjected to bending may cause the strength to drop a great deal. 
However, if many notches are put on the surface, there is scarcely any 
effect on strength. This fact is of importance in the fatigue strength 
of a body having a highly polished surface. A single scratch may be 
ruinous whereas roughing up the whole surface with many scratches 
may bring about little change in the strength. 

Weibull* and Daniels^ have obtained some interesting results relat¬ 
ing to more general situations but much remains to be done. 



PROBLEMS WITH SAMPLING PROCEDURES FOR 
RESERVE VALUATIONS* 


Ghobge C. Campbei.1. 

Metropolitan Life Jneurance Company 

The computation of tho reserve liability, running up to 
about $49 billion in North American life insurance companies, 
involves a largo amount of work, some of which might be saved 
if sampling methods were satisfactory. Certain possible ap¬ 
proaches to the sampling problem are mentioned, and the 
mathematical problem is developed for one such approach. 
Experimental results are presented from one small exploratory 
sample intended to establish orders of magnitude. Certrun ten¬ 
tative conclusions are presented. 

I T IS well known that the death rate is very low in the teens; that it is 
fairly low but increasing slowly up to the middle ages; and that it 
then increases rapidly through the very old ages. Theoretically, it is 
possible to increase the premium for life insurance each year of age in 
accordance with the risk of death in that year, starling with a very 
low premium in the beginning and reaching very high annual premiums 
late in life. Such plans have been tried and found impractical. Large and 
rapidly increasing premiums late in life are unrealistic in relation to the 
policyholder’s income. 

Very early in the insurance business it was found more practical to 
sell insurance on a level annual premium basis during the premium- 
pa 3 ring period. This means that the level aimual premiums are more 
than necessary to pay expected claims in the early years of the insur¬ 
ance. Those amounts in excess of claims must be saved and invested 
to provide a fund for the later years of insurance when the expected 
claims will exceed Uie premium collections. The theory is relatively 
simple for the computation of this fund which is called the reserve. 
Wlion a policy is issued, the present value of future net premiums, after 
deducting the loading fur expenses in the gross premium, is exactly 
equal to the present value of the benefit to be paid. In computing pres¬ 
ent value, future paymon(« are discounted to the present time on the 
basis of interest and the probability of suiwival according to a mortality 
table. Considering the case where the risk of death is increasing with 
age, if the present value of the level benefit is exactly equal to the pres¬ 
ent value of the not level premiums when a policy is issued, then at 
any time subsequent to issue, the present value of the benefit will 
exceed the present value of the future net premiiuus. The excess of 

* A papor presentod ai the 107th Annual Mooting of the American Statistical Assooiatioxii New 
York City, Decembor 30.1947. 
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the present value of the benefit over the present value of future net 
premiums at any point in time is the reserve on the policy which the 
Company must sot up in order to meet its commitments.^ The reserve 
is a definite liability of the Company which must bo shown on the 
balance sheet as required by law. 

After the mortality table and the rate of interest have been selected 
to satisfy legal and managerial requirements, the average net level 
premium reserve per $1 of insurance is uniquely determined for each 
type of policy by the contractual provisions of the policy (including 
statutory references), by the duration from issue, and by the age at 
issue. In the Metropolitan, such a classification leads to about 200,000 
individual cells for Ordinary and Industrial Life Insurance. It is nec¬ 
essary to ascertain the amount of insurance in force (i.c., in being) fall¬ 
ing into each of these cells. A large amoimt of work is expended in the 
classification of all current transactions, such as new policies issued, 
lapses, revivals, death claims, maturities, etc., into these 200,000 cells 
in order to carry forward the classification from one year to the 
next. Even after the insurance in force is obtained in this detail, it is 
necessary then to multiply the amount of insurance in each one of 
these cells by the appropriate reserve factor per dollar of insurance 
to obtain the corresponding reserve liabilities which in turn must be 
summarized into final figures. All of this work must be thoroughly 
checked in detail and by controls to insure accuracy. 

The determination of the reserve liability for aU of the policies the 
Company has outstanding is generally referred to as reserve valuation. 
Separate calculations are made for the several branches of business, 
and for related benefits. The amounts involved are voiy large and con¬ 
stitute the principal liability of any life insurance company. For the 
business at large, in North America at the end of 1947, the total assets 
amount to about $65 billion of which about $49 bUlion is required to 
cover reserve liabilities. In any one of the larger companies, the reserve 
may well run to a few billion dollars. It is clear that the reserve valua¬ 
tion problem is of great practical importance and that it involves con¬ 
siderable labor and expense. It should not be assumed, however, that 
most of this work could be eliminated simply by eliminating or modify¬ 
ing the requirements for reserve valuation. Much of the same informa¬ 
tion would still be needed for mortality studies and for other purposes. 
If it were possible to obtain the reserve liability from a reasonably small 
sample with sufidcient accuracy to protect adequately the policyholders 
and to be accepted by Company management and by State supervisory 

^ Fbr details regardins net pFeminin and reserve computations, see E. F. Spureeon, Life OonUnffm- 
eiee, Cambiidge TJxdvexsity Fress, 1932. 
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authorities, a considerable saving in Home Office expenses could be 
made. 

TUE MATnEUATIGAL PBOBLEH OF SAMFLINO 

When we approach the valuation problem from the sampling point 
of view, a number of possibilities present themselves. The maximum 
saving in clerical labor could be made if a satisfactory approximation 
could be obtained by sampling from a file of caids representing policies 
in force without the need for recourse to any running transaction 
accounts whatsoever. This would mean, for a particular block of busi¬ 
ness, that we would require reliable estimates from a sample alone 
for a number of policies, the amount of insurance, and the reserve, 
together with their sampling variances. 

It seems unlikely, however, that Company nmnagement would be 
willing to discard all control accounts on the number of policies 
and on the amount of insurance. Consequently, it seems more realistic 
to approach the sampling xn'oblem on the assumption that certain con¬ 
trol accounts would be kept, giving the number of policies and the 
amount of insurance in force accural cly for certain blocks of business 
in convenient sub-groups. The problem reduces to finding the average 
reserve per $1 insurance, and its variance, from a sample separated into 
corresponding sub-groxips. The reserve would then be estimated for 
each sub-group from the actual total amount of insurance and the 
average reserve per $1 insuraivce as estimated from the sample. 

It appears desirable to take advantage of the high correlation exist¬ 
ing between the amoimt of reserve and the amount of insurance 
on groups of individual policies. There is an exact mathematical rela- 
tionsliip for individual policies, and a correlation coefficient of the order 
of 0.7 for the aggregate group of Regular Ordinary policies in the 
Metropolitan. In most of the sampling problems related to valuation 
studied by the writer, the solution depends on the relation of the quan¬ 
tity being estimated (in this case, the reserve) either to the amount 
of insurance or to the amoxuit of gross animal premium. This rela- 
tionsliip leads to a two-way scatter diagram. 

Consider the two-way scatter diagram presented as Table I with the 
reserve factors per |1 of insurance along the X-axis and the amount of 
insurance along the F-axis. In the mathematical derivation wo must 
consider the related scatter diagram formed by the amount of reserve 
along the X-axis and the amount of insurance along the 7-axis. For¬ 
tunately, numerical results for the second scatter diagram can be ob¬ 
tained from the fiurst scatter diagram based on reserve factors by 
taking the appropriate cross product between the reserve factor and 
the amount of insurance whenever the amount of reserve is needed. 



















































TABUS 1—Continued 

DISTRIBUTION OP A SAMPLE OP 1.015 REGULAR ORDINARY POLICIES ACCORDING TO 
AMOUNT OP INSURANCE AND RESERVE PER DOLLAR OP INSURANCE 


SAMPLING FOB BBSBBVB VALUATIONS 



Totals 1 588 72 110 
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The average reserve per $1 insurance in a sub-group of any particu¬ 
lar block of business is obtained readily by summing the reserve on aU 
of the policies in the sub-^roup and dividing by the sum of the insur¬ 
ance on all of its policies. An estimate is required for the variance of 
tho avenge reserve per $1 of insurance. 

Let T=the average reserve per $1 insurance 
^=the average reserve per policy 
5=the average insurance per policy 

Then 


and 


Y = RjS for the universe 


V + AV 


len-Aje 

-=-=r for a sample 

S + AS 


Hence 


and 
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Summing for all samples, dividing by the number of samples, and 
taking unity as a reasonable approximation to the last factor 


i?*r v| (TjrtTir «r» 

5* Li?® RS 5®J 


(5) 


Replacing tho variances of averages by tho variances of individual pol¬ 
icies, we have the approximate formula 




( 6 ) 


where n is the number of policies in the sample and r is the correlation 
coefficient between B and 8. 

The above expression enables us to compute an approximate value 
of the required variance of the average reserve per $1 of insurance from 
the scatter diagram. 

EXPERIMENTAL WORE 

Some exploratory work was done with a sample of 1,015 Regular 
Ordinary policies drawn from a total in force of about 4.3 million poli- 
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cies for $9.6 billion of insurance with $2.0 billion of reserve liability. 
It was thought that a small exploratory sample would serve as a pilot 
study to establish certain orders of magnitude which might be helpful 
in planning any subsequent investigation of the samplii^ problem. 
There is no intention to use such a small sample for actual valuation 
purposes. 

A sample from this block of business was taken out near the end of 
1946 for another purpose by counting through the renewal cards and 
picking every 50th card. The sample renewal cards were microfilmed 
in order to return them to the active file immediately. The exploratory 
valuation sample of 1,015 cards was obtained from the microfilm file 
by running it through the viewing machine and stopping the film at 
e^ cry second turn of the crank. The facts from the projection in view 
were copied by hand on a separate card for this study. 

The total sample of 1,015 cards was used to determine an average 
reserve per $1 of insurance by valuing the individual cards seriatim and 
adding both the insurance and the reserve on the cards to form two 
totals. The quotient gave 0.21515 for the average reserve per $1 of 
insurance. Applying this average to the actual total insurance amount¬ 
ing to $9,587 million, an estimated reserve of $2,063 million was ob¬ 
tained which is $53 million in excess of the true reserve. This gives 0.69 
as the ratio of the error to the standard deviation of $77 million esti¬ 
mated from the sample. Hence by tius method of sampling, results to 
this degree of accuracy or better would be obtained in about half of 
repeated applications. 

Individual policies, rather than grouped frequencies from the scatter 
diagram, were used for the numerical work l^ause of Hie dedre to 
experiment with various subdivisions. In order to see how the averages 
would behave in repeated samples, the sample of 1,015 cards was fur¬ 
ther subdivided into ton groups, oach group consisting of all the cards 
with policy niunbors ending in the same digit. Table II, giving the 
results from this experiment, shows the average reserves per dollar of 
insurance, the total reserve for this block of business estimated from 
each of the ten sample averages, and the resulting errors obtained by 
comparison with the actual reserve. 

Theactual reserve from the regular arithmetical calculation amounted 
to $2,010 million. The best estimate from the small sample of 111 
cards was $34 million under the true value, while the worst estimate 
obtained from a sample of 93 cards was ^18 million over the true 
value. It would be expected that 5 of the ratios of error to the standard 
deviation would be under 0.6745, disregarding signs, and that 5 would 
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TABLE II 

BEHAVIOR OF AVERAGES FROM REPEATED SAMPLES 


Final 

Digit 

Number 
Policies 
in Sample 

Average 
Reserve 
per $1 
Insurance 

Reserve Esti¬ 
mated from 
Sample Average 
and Actual 
Insurance* 

Error 
(Estimate 
less Actual) 

Ratio of 

Error to 
Standard 
Deviationt 

0 

105 

.2279 

(Millions) 

$2,185 

(Millions) 

175 

0.73 

1 

93 

.2532 

2,428 

418 

1.65 

2 

109 

.1952 

1,871 

-139 

-0.59 

3 

111 

.2062 

1,976 

- 34 

-0.15 

4 

108 

.2517 

2,413 

403 

1.71 

5 

81 

.1959 

1,878 

-132 

-0.49 

6 

118 

.2158 

2,068 

58 

0.26 

7 

110 

.1898 

1,820 

-190 

-0.82 

8 

87 

.2372 

2,274 

264 

1.01 

9 

93 

.1712 

1,642 

-368 

-1.45 

Trim 

Values 

.20966 

$2,010 




* Actual Insurance $9,586.83 million. 

t Standard Deviation estimated from total sample of 1,015 policies adjusted for sise of individual 
samples. 


exceed this value; actually there were 4 under and 6 over. Considering 
20 per cent frequency intervals from the lower limit to the upper limit 
of the distribution, there were 1,3,1, 2, 3 ratios in each interval com¬ 
pared with the expected 2. 

The subdivision of the sample of 1,015 cards into ten random sub¬ 
groups illustrates that even with a sample of the order of 100 cards, 
it is possible to estimate the reserve, and to predict the range of errors 
with reasonable certainty because the errors, measured in terms of 
their expected standard deviation, appear to be distributed in reason¬ 
ably close agreement with tho normal frequency curve. 

EXPERIMENTAL STRATIPICATION 

In the above computation, the policies in the sample were grouped 
together. It would be natural to inquire if the result might bo improved 
by stratifying the samplie. 

It is easy to select a valuation sample stratified by years of issue 
since policy cards usually are filed in numerical order. This breaks 
them approximatdy by years of issue. Any other stratification, such 
as by plans or groups of plans, would be difficult to handle if the cards 
had to be selected initially on that basis. With some sacrifice of poten¬ 
tial efficiency, however, the sample might be stratified after selection 
for application against sudi control totals. 

An esperiment was tried with stratification by year of issue with all 
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plans combined in each year of issue. Average reserve factors per $1 
of insurance from the sample were applied to tho total business by 
years of issue. This method of estimating the reserve by a stratified 
sample reduced the standard deviation from $77 million to $47 miUion^ 
and reduced the error from $53 million to $19 million, giving a ratio of 
error to standard deviation of 0.41. This was the best result produced 
from this sample. The standard deviation of this estimate indicates 
that the error would be within about $32 million half of the time from 
repeated samples of 1,016 policies selected in this way. There would be 
a 5 per cent risk of exceeding an error of $92 million. The results from 
this experiment are shown in some detail in Table III. 

It is interesting to note that, in the last column showing the ratios 
of the errors to the corresponding standard deviations, there are 12 

TABLE III 

EXPERIMENTAL SAMPLE STRATIFIED BY YEAR OF ISSUE 


Year of 
lf»ue 

Num¬ 
ber of 
PolieicB 
In 

SampLo 

Average 
Eoaerve 
per SI 
Insuronoe 
From 
Sample 

Aetual 
Total 
Insuranee 
In Force 
(MUUons) 

Reserve 

(MUlions) 

Estlinated* 

Standard 

Deviation 

(MilUons) 

Ratio 
Error to 
Standard 
Deviation 

Estimated 

From 

Sample 

Aotual 

Error 

1945 

71 

.04148 

S775.44 

32.15 

20.30 

HIHfljS 

4.07 

.70 

1944 

80 


883.29 

37.93 

37.11 

■ia 

4.08 

.20 

1943 

37 


640.93 

34.18 

37.21 

- 3.03 

8.68 

- .48 

1942 

65 

.08048 

438.58 

35.30 

38.58 

- 1.28 

4.33 

- .29 

1041 

82 

.11578 

695.02 

88.89 

81.08 

7.81 

10.37 

.76 

1940 

59 


440.87 

58.98 

54.19 

2.77 

4.88 

.67 

1939 

59 

.13598 

425.47 

57.86 

59.33 

- 1.48 

7.43 

- .20 

1938 

48 

.13875 

408.94 

58.74 

67.40 

-10.88 

11.23 

- .96 

1937 

59 

.17848 

478.37 

85.02 

87.15 

- 2.13 

8.97 

- .24 

1938 

42 

.18573 

439.48 

72.83 

90.42 

-17.69 

6.98 

-2.94 

1936 

45 

.22882 

418.48 

96.07 

05.80 

.07 

7.23 

.01 

1934 

50 

.20800 

403.89 

90.45 

112.15 

-16.70 

, 13.53 

-1.16 

1933 

58 

mwm 

374.42 

100.38 

98.10 

4.28 

7.70 

.58 

1932 

44 


343.81 

101.00 

08.40 

3.20 

11.21 

.20 

1931 

34 

.27207 

391.41 

100.40 

118.18 

-n.69 

0.14 

-1.28 

1930 

25 

.38300 

353.63 

135.42 

118.37 

10.05 

10.65 

.97 

1929 

32 

.37950 

383.52 

137.99 

120.85 

8.34 

11.67 

.72 


29 


315.03 

134.70 

121.80 

13.07 

11.63 

1.12 


33 

KSS 

289.53 

120.05 

112.53 

7.62 

10.67 

.71 


18 

,33777 

185.98 

62.81 

84.62 

- 1.81 

3.82 

- .47 

1926 

9 

.37549 

177.88 

60.79 

66.47 

1.82 

4.10 

.32 

1924 

17 

.42289 

151.01 

63,83 

67.86 

6.97 

4.49 

1.33 

1923 

7 

.33840 

140.78 

47.63 

68.86 

- 9.03 

3.89 

-2.32 

1922 

8 

.68092 

87.57 

i 49.12 

38.38 

12.74 

3.25 

8,92* 

1921A prior 

38 

.49779 

348.04 

172.25 

188.43 

3.82 

18.80 

.20 

Total Sample 

1,016 


to,588.81 

$2,029.09 

$2,009.84 

19.25 

47.17 

.41 


* Note that the estimate of the standard de^alaon becomes very unr<diable whon the number of 
poUoies is small; for example there are only 6 polides in the sample for the issue of 1922. 
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below and 13 above the absolute value 0.6745, compared with the 
expected 12.5 in each group. If the 25 ratios are classified into 20 per 
cent frequency intervals from the lower to the upper limit of the dis¬ 
tribution, the successive intervals contaiu 5,3,5,8,4 ratios as compared 
with the 5 expected iu each interval. This agreement seems satisfactory. 

It will be noted that the stratification experiment was based entirely 
on the sample drawn before considering stratification. Consequently 
this is not an indication of the best result that might be attained from 
eflcient stratification. If the sample had been drawn originally with a 
view to stratification, it would have been selected in such a way as to 
minimize the standard deviation of the final estimate. This would have 
required drawing more cards in some of the years of issue where the 
reserve is heavy and a smaller number of cards in other years of issue 
where the reseirve is very li^t. For example, there were 71 cards in the 
issue of 1945 where the total reserve is only $29 million but only 32 
cards in the issue of 1929, which has a reserve of $130 million. Conse¬ 
quently the issue of 1945 contributed only 17 million million to the 
total variance, while the issue of 1929 contributed 134 million million to 
the total variance. 

For a given total number of cards in the over-all sample, the selection 
by years of issue should be made on a basis to minimize the variance 
of the resulting estimate. The variance was recomputed approximately 
on the basis of the most efiBicient distribution of the sample by years of 
issue, assuming that the estimated variances and correlation coeffi¬ 
cients among individual policies would not change within the years of 
issue. A new value of 1,692 million million was obtained for comparison 
with the 2,225 million million in the actual sample. These figures indi¬ 
cate that for this distribution a given degree of accuracy could be 
attained with about one-quarter fewer cards in the sample if it wore 
stratified properly by years of issue to minimize the variance. The cor¬ 
responding standard deviations would be $41.1 million compared with 
$47iS million. 

On the basis of stratification by year of issue, but without reflecting 
the most efficient distribution of the sample. Table IV has been pre¬ 
pared for Metropolitan Regular Ordinary business, giving the approxi¬ 
mate number of policies that would be required in a sample in order to 
provide various degrees of accuracy and risk. The amount of error is 
shown down the left margin both as a percentage and as an approxi¬ 
mate amoimt of money based on the $2.0 billion reserve in this block 
of business. The amount of risk is shown across the top of the table. 
For example, if a sample of 80,000 i>olicies were drawn time after time. 
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TABLE IT 

ABFBOXIMATE SIZE OF SAMPLE FOB TABIOUS DEQBEES 
OF ACCUBACY AND BISK* 


Error in Reserve 

Risk of Exceeding Designated Error 

Amount 


60% 

■un 

6% 

1% 

0 1% 

(Millions) 

% 

Approximate Number of Policies Required in Sample* 

$ 2 

0.1 

230,000 

1,100,000 



1,950,000 


2,500,000 

5 

0 25 


220,000 



600,000 


760,000 

10 

0 5 


67,000 



140,000 


220,000 

20 

1 


14,500 

21,000 


35,000 


57,000 

40 

2 


3,600 

5,100 


8,800 


14,500 

80 

4 

1 150 

900 

1,300 


2,200 




* Adjusted for eamplioR from fimte umTeteoof 4 27 million BegulsrOrdmoiy pohcaea in the Metro¬ 
politan Idfe Insuranre Company* 


and if the reserve were estimated from each such sample, the error 
would not exceed 0.5 per cent or $10 million in more than 5 per cent of 
such estimates. It might be said also that the error would not exceed 
0.25 per cent or $5 million in more than one-third of such estimates. 

CUBBBNT APPUCATIONS 

The exploratory work reported above gives an indication of the pos¬ 
sibilities of sampling procedures in reserve valuations. The question 
naturally arises as to what extent these techniques are actually in use 

As a matter of fact, sampling has been used very little, if at all, for 
any major reserve valuation but it has been used in a number of in¬ 
stances for smaller items which would be very troublesome to obtain 
from a complete inventory. One large item of this nature is the deferred 
premium asset for Metropolitan llegular Ordinary business. This asset 
arises from the fact that a very considorable proportion of premiums 
are paid semi-annually, quarterly, or monthly. Since the reserve lia¬ 
bility is computed for simplicity on the assumption that all premiums 
are paid annually, it is necessary to calculate an adjusting item for 
the balance sheet. 

Tlio item of deferred premiums depends entirely on the distribution 
of premiums according to tlie month of issue date and according to the 
mode of payment (annual, somi-aimual, quarterly or monthly). Ob¬ 
viously this distribution does not change very rapidly with time, and 
the results from a single sample can be carried forward for a few years. 
The problem is to obtain from a sample the estimated average deferred 
premium at the end of the calendar year per dollar of gross aimual 
premium in force, together with the variance of the estimate. A 2 per 
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cent random sample amoimting to 91,011 cards was drawn for the most 
recent estimate in 1946. For this sample, the average deferred premium 
per dollar of gross annual premium in force was $0.1964, with the stand¬ 
ard deviation indicating an error range of about 1 per cent of the 
average at the two sigma level. Thus, in an item of $50 million, there is 
a risk that about once in 20 years, on the average, the error in the esti¬ 
mate will be more than $0.5 million. Although the valuation of this one 
item involves a veiy substantial amount, it is veiy small in relation to 
the reserves for the total business of the Company. 

Another end-of-year adjustment item, “Due and Unpaid Premiums,” 
arises from premiums not paid on the due date because policies are not 
transferred to the non-forfeiture accounts imtil sometime after expira¬ 
tion of the “grace” period. This item is much less stable than the de¬ 
ferred item because it depends considerably on current economic psy¬ 
chology in addition to the proportion of gross annual premium due in 
November and December. The average premium due and unpaid at 
the end of 1946 amounted to $0.0496 per dollar of gross annual premium 
in force, giving a total amount of about $12 million. Because this item 
may change rapidly with time, and is likely to be influenced by Christ¬ 
mas buying, a new estimate is required each year at the end of Decem¬ 
ber. The amount of premiums due and impaid at the end of 1947 was 
estimated from a sample of 100 district averages selected from the 
finite universe of 787 district averages in the total Company. Sampling 
among these averages rather than among individual policies is much 
less efScient statistically, but it has been adopted because of certain 
filing advantages. For the same accuracy, about ten times as many 
cards are required to obtain the sample district averages as would be 
used in a random sample of individual policies. 

The Metropolitan uses a sampling method for the reserve of nearly 
$13 million on Additional Insurance purifiiascd by the application of 
dividends. A sample is taken of all the policies with issue date anniver¬ 
saries in two different months of the year, giving about 35,000 policies 
with Additional Insurance. For each of nine subdivisions, the amount 
of Additional Insurance and the amoimt of reserve on this insurance is 
obtained from the sample. Then the average reserve per doUar of in¬ 
surance is obtained. The total reserve in the nine sub^visions is esti¬ 
mated by multiplying corresponding averages by the total amount of 
Additional Insurance for each subdivision. The result from one sample 
is carried forward from one year to the next by an accumulation proc¬ 
ess which reflects the new dividends applied as single premiums to 
purchase Additional Insurance, the interest required on the reserve, and 
adjustments for terminations. The sampling process is repeated every 
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three to five years as a check and correction on the accumulation proc¬ 
ess. 

GENBBAli CONSIDEBAXIONB 

A question will arise as to why sampling methods have not been used 
more widely for valuation purposes, and to what extent such methods 
may be used in the future. 

The answer hinges on the degree of accuracy demanded in the result. 
With respect to reserve valuation the experimental results presented 
indicate that a sample of very reasonable size, 21,000 policies for ex¬ 
ample, would give substantial assurance that the estimated result 
will be accurate to 1 per cent, but a 1 per cent error in the reserve of 
$2 billion on the Kcgular Ordinary business becomes a $20 million 
error. Neither Company management nor State Insurance Depart¬ 
ments would permit errors of this size. Although an error of $20 mil¬ 
lion is clearly larger than could be tolerated in practice, the amoimt 
of error might be reduced within certain limits by increasing the rize 
of the sample. In any sampling plan it is necessary to determine the 
tolerance limits before proceeding to determine the size of the sample. 

The reserve is based on very broad assumptions even if it is com¬ 
puted arithmetically with complete correctness, the assumptions being 
in regard to the probabilities of death and in regard to the rate of 
interest. A very moderate change in one of these assumptions may make 
a very substantial absolute change in the total reserve even thou^ the 
percentage change may be relatively small. For example, if it is decided 
to assume that the Company will earn an interest rate of 0.25 per cent 
less in tho future than previously had been assumed, this change in 
assumption may well add some $60 million to a reserve of $2 billion. 
Considering the approximate nature of the basic assumptions under¬ 
lying the ro.serve calculation, a statistician might well inquire whether 
the degree of arithmetical accuracy now demanded is rcaUy necessary. 
There arc other important practical considerations, however, which 
must be taken into accoimt. Let us inquire into the source of the re¬ 
quirement and into the extent to which such accuracy is really needed. 

Section 205 of the current New York Insurance Law provides in part, 
"The Superintendent shall annually value, or cause to be valued, the 
reserve liabilities (hereinafter called reserve) for all outstanding life 
insurance policies and annuity and pure endowment contracts of every 
life insurance company doing business in this state,... In calculating 
such reserves, he may use group methods and approximate averages for 
fractions of a year or otherwise.” This last smitence may be interpreted 
to permit the use of sampling techniques. In practice, however, the 
New York Department of Insurance is reluctant to approve approxi- 
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mate methods of valuation unless it can be demonstrated that the eiror 
is BTnftll. Moreover, there is a tendency to consider the error as an ab¬ 
solute amoxmt rather than as a percentage. 

After the CSompany makes the valuation required by law, it for¬ 
wards its working papers to the State Insurance Department for further 
checking and review. The formal transmittal to the New York Insur¬ 
ance Department is accompanied by an affidavit signed by the Presi¬ 
dent, the Vice-President and Chief Actuary, and the Assistant Actu¬ 
aries in immediate charge of the work, and reads in part: 

"That the valuation schedules, sununaries and other valuation 
data ... have been submitted to the State of New York Insurance 
Department as the sole basis for calculating said Company’s valua¬ 
tion results and issuing its Certificate of Valuation; 

“That such schedules have been properly and correctly prepared 
from the policy and annuity valuation records and that such 
records accurately and completely cover all of the policies and 
annuities of the said Company outstanding and paid-for on the 31st 
day of December, 19 ... ; 

“That such schedules contain, (1) complete and correct group¬ 
ings according to age, issue, duration, kind and amount of all such 
policies and annuities, with all the data necessary for making a 
mean, grouped valuation of the same upon the net premium basis 
as required by law, and (2) full and complete information necessary 
for calculating all special net reserve liability items, including 
reserves for disability and accidental death benefits; 

“That the data for the said mean, grouped valuation produces 
results equal to those required by the terms of the policies and 
annuities and the law applicable thereto: 

“That the above named Actuary has charge of the Company’s 
policy and annuity valuation records and of the valuation sched¬ 
ules and summaries, and has supervised the preparation of the 
schedules and of the Analysis of Valuation hereto attached and 
bearing his signature, and that the checks placed on the same 
insure their accuracy; and 

“That the total amount of reserves on said policies and annui¬ 
ties is correct as reported on such schedules, summaries and other 
valuation data and as set out in the said Analysis of Valuation, 
according to their best information, knowledge and bdief.” 

It is veiy difficult to base a complete valuation on sampling methods 
and still submit sudb a specific affidavit even if the sample were in¬ 
creased to an impractical size sufficient to eliminate almost all uncer¬ 
tainty. 
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SAUPUNO FOB BESEBVeVaX.TTATIONS 

Aside from the legal requirements, Company management could not 
tolerate an error of estimation which would become s^nificant when 
carried into the surplus, and which would become more significant in 
the surplus earnings of a single year. An error of $1 million in the re¬ 
serve, in opposite directions at the beginning and at the end of the 
year, would result in an error of $2 million in the surplus earnings of the 
single year. The amount of annual surplus earnings, the starting point 
for the distribution of dividends to policyholders, is about $80 milli on 
for the block of business on which the experimental results of this dis¬ 
cussion were based. It is clear that the amount of acceptable error due 
to sampling fluctutions is limited severely by the necessity for accuracy, 
consistency, and continuity in this important item. 

FUTTJBB APPUCAnONS 

It appears likely that the immediate extension of sampliug techniques 
in reserve valuations will be limited to the reserves on related benefits, 
including certain paid-up policies, on which the total reserve is so small 
that an error of the order of 1 per cent does not look too large in rela¬ 
tion to the surplus earnings in a single year. 

One can only speculate in regard to the future development of sam¬ 
pling techniques for the more important valuation of the main life in¬ 
surance reserves, nmning into billions. Application to independent 
checking and auditing mi^t well be a possible development of sampling 
techniques. State Insurance Departments frequently consider the prob¬ 
lem of reconciling the active file of renewal cards with the separate book 
record in force classification accumulated over the years from the trans¬ 
actions and used as the basis for the arithmetical calculation of the 
reserve liability. The cost of a complete inventory would be excessive, 
not to mention the difficulty of “freezing” the very active file of renewal 
cards long enough to carry out the work. The exploratory methods used 
in this discussion, with some extension and with a reasonably large 
sample, could provide an independent (ffieck on the total reserve which 
would detect any error of sufficient size to be significant when carried 
into surplus. 

Progress may be expected in developing sampling techniques for the 
main life insurance reserve, perhaps by the indirect approach of sam¬ 
pling transactions rather than by direct frontal attack. It probably will 
take years of study, however, before sampling methods will become 
practicable and generally acceptable for the main life insurance reserve 
liability in the Annual Statement. 



RECENT DEVELOPMENTS IN GRADUATION AND 
INTERPOLATION* 


Thomas N. E. Gbhvillb 

National Office of Vital Statistics, U. S, Public Health Service 

Some developments during the 10-year period 1938-1947 in 
the techniques of graduation, or the smoothing of data, and of 
interpolation, are reviewed and summarized. In preparing this 
summary of developments in graduation and interpolation 
during the past decade, it has been the author’s intention to 
refer to the more important results which have come to his 
attention, and which he considers are likely to be of interest 
to practical statisticians and actuaries who have occasion to 
perform graduations or interpolations of actual data. No at¬ 
tempt has been made to deal with the more abstract topic of 
interpolatory function theory. Within the limits thus set, 
there has been no intention to slight anyone, and the author 
will appreciate having his attention called to any omissions. 
For derivations, proofs, and more detailed treatment, the ref¬ 
erences given in the bibliography may be consulted. 

I. GBABUATION 

C Mracterisiic function of a graduation formula. Graduation (or 
^smoothing”) has been defined as "the process of securing from an 
irregular series of observed values of a ... variable a smooth regular 
series of values consistent in a general way with the observed series of 
values” which "is then taken as a representation of the underlying law 
which gave rise to the series of observed values” [17].* A traditional 
method of graduation is based on the use of weighted moving averages.* 
This method may be represented by a formula of the form: 

( 1 ) 

wbeie f/w denotes the sequence of given data to be smoothed, F» is the 
"smoothed” value of y» and the numbers L, are the numerical coef¬ 
ficients which characterize the formula. In practice, of course, both the 
given data and the coefficients L, are limited to a finite range. 

In an eluant study of this method of graduation, Schoenberg [18] 

* A paper presented at the 107ih Annual Meeting of the American Statistical Association, New 
York City, December 30,1947. 

1 Numbers in brackets refer to the bibliography at the end of this artidle. 

> This method is also referred to as graduation by 'Rested averages” or by ^linear compound¬ 
ing.” Prominently associated with its early development are the names of De Forest [28] and Sheppard 
[ 10 , 20 ]. 


428 




GBADTTATION AND INTERPOLATION 


429 


has defined a "characteristic function” for each such smoothing foi> 
mula, and has shown that the properties of the formula are uniquely 
related to certain properties of this function, whidbi thus uniquely 
“characterizes” the smoothing formula; so that, knowing the function, 
one can deduce the formula, and vice versa. Specifically the charac¬ 
teristic function of the given sequence is defined as 

00 

T(u) = 52 yne*"“, 

n «»*—00 

where u is an arbitrary parameter. Similarly the characteristic function 
of the smoothing formula is defined as 

00 

4>{u) = 52 

By multiplication, it is evident that 

00 

!r(«)^(tt) =a 52 Pne”^*, 

flBB—*00 

so that the characteristic function of the “smoothed” sequence is the 
product of the characteristic functions of the given data and of the 
smoothing formula. If the coefiSicients L, are symmetrical in the sense 
that L-,=L„ the characteristic function of the formula can be ex¬ 
pressed in the form of the Fourier series: 

^(tt) = Lo + 2 L» cos nu. 

In terms of the characteristic function, Schoenberg proposes a cri¬ 
terion for stating when a formula of this type is really a “smoothing” 
formula in the sense that its application may be expected actually to 
increase the smoothness of the data. It has been customary to judge 
the smoothness of a graduation by the size of some specified order of 
finite differences of the graduated values. Using the same approach, he 
establishes the relations 

52 r (2 sin | T(,u) l*dtt, 

nB»«>oo V 0 

i: (A“F,)* = 1 f *' (2 sin §«)*« 1 r(«) 1* I m l*d«, 

where m is any nonnegative integer. Observing that the integrands of 
these two expressions differ only by the factor |^(u)|S he defines a 
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smootMag formula as one for which. | ^(u) | ^ 1, for 0 ^ 2t, with the 

additional stipulations* 

2 = 1 and X I L„| < ». 

nva—00 n=»—oo 

Application of orthogonal polynomials to gradtiation formvlas of maxi¬ 
mum smoothness. A smoothing formula is said to be exact for the degree 
m if it reproduces exactly the values of any pol 3 momial of degree not 
exceeding m. In this connection, Schoenberg shows that a formula of 
the type (1) is exact for the degree m if and only if ^(«) — 1 has at«=0 
a zero of order m+1, or, in other words, if this expression and its first 
m derivatives all vanish for u=0. 

If for n>%, where If], and 1,-4, are not zero, the formula is 

said to he of span 2fc+l. If each of 24+1 consecutive ungraduated 
values is assumed subject to a random error with zero mean and vari¬ 
ance the same for all the 2A;-t-l values, then the ratio of the mean 
squared error of the graduated value to that of the ungraduated value 
is 

Lifl 

t Uw -t 

Therefore particular interest attaches to those formulas which minimize 
this ratio for a given span and degree reproduced. These are said to 
effect the maximum "reduction of error,” and are sometimes referred 
to as formulas of “maximum weight.”* Those formulas for which 

i: (A»L„)* 

is a minimum have frequently been called formulas of maadmum 
smoothness. However, it must be admitted that the choice of third 
differences for this purpose is somewhat arbitrary, and that an argu¬ 
ment for the use of some other order of differences could be advanced 
with equal justification. It has long been known [7, 1, 16] that the 
computation involved in performii^ a graduation by a formula of max¬ 
imum error reduction can be considerably simplified by the use of a 
set of orthogonal polynomials associated with the name of Gheby^ev. 


> Seihoenberg limits the definition to fonnnias with symmetric coefficieniB» but it appeals to be 
equally applicable to those with unsyxnmetrie coefficients. 

< This is, of couise, a different use of the word ^weight* from that emidoyed when a ^Sveifi^ted 
average* is referred to. For a fuller discussion of the prindides underlying these formulas and the for* 
m n las of m a rimum smoothness next mentioned, see [17, Chapter 4]. 
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Tbe Chebsr^ev polynomials n) are ^ven by tbe formula: 

Ti(x, n) = A*[*(o(a: - «)«,], 

where »«) denotes the expression *(*—1) • • • (*—t+l)/t! These poly¬ 
nomials are orthogonal over the summation range 0 to n—1: in other 
words, we have: 

n-~l 

2 T^(x, n)Tj{x, n) => 0 


if »Vi, while 

Z [nx, n)l* = Si{n) 

SBsbQ 


n(n* - 1)(»* - 4) • • • (n* - <*) 
(2t -I- l)(i!)* 


If we now consider the smoothing formula for obtaining a graduated 
value of which involves the given values y, to inclusive, 

is exact for the degree r, and gives maximum reduction of error, it is 
well known that the coefficient of y<H-s is given by 



Ti{x, n)Ti{z, n) 
8iin) 


( 2 ) 


This remarkably simple expression for the coefficient results from the 
orthogonal property of the polynomials Ti{x, n). As fairly extensive 
tables of the values of Ti(x, n) and S,(n) have been published [3], this 
formula makes it easy to compute linear compound coefficients, not 
only for the symmetrical ‘‘mid-panel” formulas used over the greater 
part of the range of given data, but also for imsymmetrical “end- 
panel” formulas to be used near the ends of the range. Greville [13] has 
recently shown that the coefficients for the formulas of maximum 
smoothness also con be expressed in terms of the Cheby^ev poly¬ 
nomials. Using the same notation as in formula (2), with the addition 
that m denotes the order of differences taken as a criterion of smooth¬ 
ness, the coefficient of 


Lm—m — 


(—l)’"(g+m) (2—n) 



A"^Tiix, n+m)A”'Ti(z, n-bw) 
(m+i) 


where denotes s(a—1) • • • (z—m+l). 

Differ 0 nce eqmtim graduation. For a number of years, a favorite 


> This formula represents a slight improTement over the one given in the aforementioned artide. 
For derivation of this formula, see the author's reply to the discussion of [13]. Seeord, dmeiioaa Insdtute 
of AotuaiieB, Vol. 37, Fort 1, No. 76, April 1948. 
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method of graduation among American actuaries has been the "differ¬ 
ence equation” method developed by Whittaker [27] and Henderson 
[14,15]. This method results from minimizing the expresfflon: 

2 («J« “ y*)* + 

StaBia s^Q 

where ya denotes the given values, which are assumed to be known 
over the range a;=a to «=&, 14. denotes the graduated values, m is a 
specified order of differences, and k is a parameter to be diosen by the 
graduator. The first term in the above expresdon is a measure of fidel¬ 
ity to the given data, while the second is a measure of smoothness, and 
the parameter k enables the graduator to assign whatever relative 
weight he desires to these two requirements. It turns out that the 
graduated values satisfy the difference equation: 

Ua -b (-l)’"fc8®“ti. = y„ (3) 

where S denotes a difference taken centrally. This equation holds for 
with the understanding that A"*Mx= 0 for a;<o and for x>h—m. 
Early methods of solving the difference equation (3) subject to these 
end conditions produce only approximations to the graduated values 
near the beguming of the sequence, or involve successive approxima¬ 
tion [14, 15], or else involve extensive computation [21]. Weaver [25] 
published in 1943 an abbreviated method of solution which gives the 
exact values directly. His method is based, in essence, on modem 
"short-cut” methods of solving simultaneous linear equations [8]. Let 

=* cy. -f- Ci,a-\Va-l -H Ct,a~^i^i + • • * + (4) 

= Co,»»» + Cl,aUaii — C 2 ,,«<e 4 * + • • * + (— 1 (5) 

wheredenotes a sequence of "partly graduated” values intermediate 
between u. and ya, the coefficient c of y. is a constant, and it is imder- 
stood that those “c” coefficients are zero which would be applied to 
"»’s” with subscripts less than a or to “u’s” with subscripts greater 
than b. Expanding the differences in equation (3) in terms of the u. 
values themselves gives a set of simultaneous equations connecting 
the “tt’s” and “y’s.” Another such set of equations is obtained by 
solving equation (5) for v. and substituting the resulting expression for 
all the "v’s” in equation (4). Equating coefficients in the two sets of 
equations gives a set of relationships between the "c” coefficients from 
which these coefficients can be succestively determined. The possibility 
of expresdng the relationship between the “w’s” and "y’s” in the form 
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of equations (4) and (5) depends on the fact that the matrix of the 
coefficients in the simultaneous equations is symmetrical about the 
principal diagonal. Spoerl [23] has previously described the application 
of this method to the solution of simultaneous equations derived from 
difference equations. 

Spoerl [22] has also generalized the difference equation method of 
graduation by considering the so-called "mixed dffieience case,” in 
which the expression to be minimized is taken as 

6 b-2 6-3 

E («. - y.)* + E + & E 

XBia xsma siaa 

where h and k are both arbitrary parameters. 

II. INTEBPOIA.TION 

SystematizaHon of osculatory interpolation formulas. One of the most 
common practical problems in interpolation is that which is sometimes 
called the problem of subtabulation. This is illustrated by the case in 
which values of the rate of mortality, or some other actuarial function 
dependmg on age, are given for every fifth or tenth year of age, and it 
is desired to supply the intermediate values for each single year of age. 
The traditional method of interpolation by means of a moving poly¬ 
nomial arc of degree k—1 determined by k consecutive given values 
gives rise to troublesome discontinuities when applied to empirical 
data. This long ago led actuaries to alter the conditions imposed on the 
polynomial arcs by decreasing the number of given points which the 
curve is required to pass through, but requiring the continuity of first 
derivatives, and, in some cases, hi^er derivatives, of the composite 
interpolation curve [29]. Groville [10] gave, in 1944, a systematization 
of the formulas of this type, which enables one to develop rapidly the 
formula, if any, satisfying given predetermined conditions as to span, 
orders of derivatives required to be continuous, degree of the poly¬ 
nomial arcs used, exactness to a specified order of differences, and 
reproduction of the given values. 

For example, it is shown that a linear compound formula for inter¬ 
polation in equal intervals by means of a composite interpolation curve 
made up of spliced polynomial arcs and having a continuous first 
derivative can be expressed in the form 

y»+a = [**(1 + 2a;)(^o ~ -Bi) + zBi — ^xzH^Bi\yn 
+ [**(1 + ~~ Bi) -h zBi — 

+ ») 


( 6 ) 
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where 0^®^], a denotes 1—®, S denotes a difference taken centraDy, 
Ao and Bt are Wte difference operators of the form 

Co "k Ci8* 4" cjS* 4* • • • , 

and P(ff, ®) is any linear compound of the “y’s” having polynomials in 
® as coefficients. Suppose it is desired to obtain a 4-tenn formula using 
third-degree polynomial arcs, correct to second differences and repro¬ 
ducing the given values. Since the pol 3 momial arcs are to be of the third 
d^ree, we must have P(y, x) =0. Reproduction of the given values 
means that .4.0=1; to have a 4r-term formula, Bi must be limited to the 
first term co only; and for correctness to second differences co must have 
the value unity. Substituting these values in equation (6) gives the 
well known Earup-Eing interpolation formula. 

On the other hand, if it should be desired to forego the requirement 
that the given values be reproduced but to require instead that the 
polynomial arcs be of the second degree, .do is of the form 1+kS^. 
In order for the polynomials to be of the second degree we must have 
4o=(14-i5*)jBi. Since Bi=l, that means that which gives 

y»+» 4- 4- (® 4" 

which is Jenkins’ second difference smoothing interpolation formula, 
a well-known formula which not only interpolates but, at the same 
time smooths the given data. 

Characteristic function, of an interpolation formula. Grenerolizing his 
previously mentioned study of graduation, Schoenberg [18] has also 
developed a characteristic function for interpolation formulas ex¬ 
pressible in linear compound form, so that the interpolatory properties 
of the formula can be inferred from certain properties of the character¬ 
istic function. Let the interpolation formula be 

Fix) = E yMx - v), 

C O 

where the numbers y, represent the given data, F(x) denotes the inter¬ 
polated value, and L(®) is called the “basic function” of the interpola¬ 
tion formula. The characteristic function* is defined as 

?(«) = J* L(®)e*'“d®. 


* A funotion if(tt) ddfizied in this ™*""^** in toms of snothor fiinotion L{x) is xofsExod to 

as the Fonrier tiansfoxm of Lix), 
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K, as is usually the case, L{x) is an even function, this may be 'written 
in the form 



L(x) cos uxdx. 


Schoenbei^ then shows that the interpolation formula is correct to 
fcth differences if g(u)—l has a zero of order k+1 at tt=-0 and, at the 
same time, g(u) has zeros of order k+1 for u=‘2vn, where n is any in¬ 
teger other than zero. He also shows that the formula reproduces the 
given values if and only if 

00 

2 + 2^) s 1. 


Incidentally, he further shows that the basic functions of the weU- 
kno'wn interpolation formulas can be expressed in a remarkably com¬ 
pact form through the use of Fourier integrals. For example, Jenkins’ 
well-known fifth-difference smoothing interpolation formula is given 
by: 


L(x) 


1 /• * /sin 


I cos 


Minimized difference interpolation. Beers [4] has suggested that, as a 
practical matter, in most actuarial and statistical applications the in¬ 
terpolator usually is interested in interpolated values only for certain 
discrete, equally spaced arguments. For example, in the case of actu¬ 
arial functions depending on age, it is usually only the values for in¬ 
tegral ages that are of importance. Accordingly, he argues that the 
consideration of continuous curves 'vidth continuous derivatives is iiv 
relevant and unnecessary, and proposes the development of tables of in¬ 
terpolation coefficients in linear compound form designed to minimize, 
on the average, the sum of the squares of a specified order of finite 
differences of the interpolated values, subject to stated requirements 
as to span, correctness to a spedfied order of differences, and reproduc¬ 
tion of given values. Thus, in his method the interpolated points are 
not thought of as lying on any particular polynomial or other curve, 
and no mention is made of derivatives or their continuity. 

For example, in the case of a 6-term formula correct to fourth differ¬ 
ences and reproducing the given values, if it is assumed that any two 
consecutive fifth differences of the given data are independently dis¬ 
tributed random variables with zero mean and equal -variances, the 
interpolation coefficients for subtabulation in fifths which minimize, on 
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TABLE 1 


INTERPOLATION COEFFICIENTS FOR BEERS* MINIMIZED FIFTH-DIFFERENCE 
FORMULA FOR SUBDIVIDING THE INTERPOLATION INTERVAL INTO FIFTHS 

PART A—TO BE USED FOR THE FIRST TWO INTERVALS 


CoefGlcients of Vx To Obtain: 

» - 

V.* V.4 V.a y.% Vi.t Vi.t j/i.f 1/1. a 


0. +.6667 +.4072 +.2148 +.0819 -.0404 -.0497 -.0389 -.0191 

1 . +.4969 +.8344 +1.0204 +1.0689 +.8404 +.6229 +.3849 +.1659 

2 . -.1426 -.2336 -.2456 -.1666 +.2344 +.6014 +.7534 +.9354 

3 . -.1006 -.0976 -.0636 -.0126 -.0216 -.0646 -.1006 -.0906 

4 . +.1079 +.1224 +.0884 +.0399 -.0196 -.0181 -.0041 +.0069 

6. -.0283 -.0328 -.0244 -.0115 +.0068 +.0081 +.0063 +.0016 


PART B—TO BE USED EXCEPT FOR THE FIRST TWO 
AND LAST TWO INTERVALS 


Coeffidents of Vx To Obtain: 


]/fH-.s Vn^A l/n+.« l/n+'t 


n-2. +.0117 +.0137 -.0087 +.0027 

n-1. -.0921 -.1101 +.0771 -.0311 

n. +.9234 +.7194 +.4454 +.1864 

n+1. +.1854 +.4454 +.7194 +.9234 

»+2. -.0311 -.0771 -.1101 -.0921 

n+3. +.0027 +.0087 +.0137 +.0117 


PART C—TO BE USED FOR THE LAST TWO INTERVALS 


Coefficients of Vx To Obtain: 


v»-.« 


«-6. +.0016 +.0053 +.0081 +.0068 -.0116 -.0244 -.0328 -.0283 

»-4. +.0069 -.0041 -.0181 -.0196 +.0399 +.0884 +.1224 +.1079 

*-3. -.0906 -.1006 -.0646 -.0216 -.0126 -.0636 -.0976 -.1006 

»-2. +.9364 +.7534 +.6014 +.2344 -.1666 -.2466 -.2336 -.1426 

»-l. +.1669 +.3849 +.6229 +.8404 +1.0689 +1.0204 +.8344 +.4969 

*. -.0191 -.0380 -.0497 -.0404 +.0819 +.2148 +.4072 +.6667 


the average, the sum of the squares of the fifth differences of the inter¬ 
polated values are those given in Table I. While, on theoretical grounds, 
a fairly hi^ negative correlation between successive fifth differences 
would be expected, it will commonly be found in practice that the 
actual behavior of the fifth differences of empirical data approximates 
closely enou^ to the assumptions stated so that Beers’ formula sdelds 
plausible results with a highly satisfactory degree of smoothness. 

Tables of interpolation coefidcients deduced on the same or similar 
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principles have been published by Beers [4, 6, 6], Greville [11], and 
Wells [26]. 

Relationship of interpolation to summation graduation. Vaughan [24] 
has shown that there is an intimate connection between linear com¬ 
pound subtabulation and the “summation” formulas of graduation 
which have received much attention from English actuaries. The latter 
are a special type of linear compoxmd graduation formula, in which the 
relationship between the graduated and ungraduated value is of the 
form: 


Ux = 


[p][g][r] • • • 




pgr • • • 


where are positive odd integers, [p] is an operator denoting 

the summation of p consecutive given values ssrmmetrically situated 
about the value to which the operation is applied, and /(i) is an 
“operand” of the form l+Ci8*+csS*4- * • •, which is usually chosen 
so as to make the graduation formula correct to a specified order of 
differences. This typo of formula was of greater interest and impor¬ 
tance before computing machine came into general use, as the use of 
summations in the formulas then resulted in a substantial short cut 
in computation. 

Let a subtabulation formula for dividing the intervals between given 
data into k parts be 

00 

r»-ao 


Then, there corresponds uniquely to this interpolation formula the 
graduation formula 

- -T £ 
k 

Vau^ban shows that the latter formula can be expressed in the form 

I7x = 

where n is the order of differences to which the interpolation formula is 
correct. Moreover, the terms of /(5) as far as 8* must be such that the 
entire graduation formula is correct to nth differences. As the number 
of possible terms in /(8) is limited by the span of the interpolation 
formula, this leaves, in general, only a small number of undetermined 
coefficients. This principle therefore provides an alternative method of 
derivation of interpolation coefficients satisfying such criteria as Beers 
has proposed. 
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Interpolation by applying a least square criterion over the entire range. 
Vaughan [24] and Greville [12] have considered independently a some¬ 
what different approach to the subtabulation problem. In this ap¬ 
proach, instead of expressing each interpolated value in terms of a few 
neighboring values, the object is "to ascertain definitely the inter¬ 
polated terms which will have over the whole series the smallest possi¬ 
ble sum of squared differences of any ^ven order” [24]. It is shown 
that in this case the given and interpolated values are connected by the 
relation: 




E-*F{E) 

fm 


Vrk, 


(7) 


where h is the interval of subtabulation, E denotes the displacement 
operator defined by Ey»=ya+u 


F{E) = 


(jgwa _ j^knym 
(J51/2 _ 


( 8 ) 


(m being the order of differences to be minimized), and /(jB*) is an 
expression consisting of only those terms in the expanded quotient of 
formula (8) in which the exponent of ^ is a multiple of h. The solution 
of the difference equation (7) is facilitated by iutroducing an intermedi¬ 
ate sequence Va which vanishes except when a; is an integral multiple 
of h, such that 

= E-’>F{E)u,k, 


and 

Wo,* = y,*. (9) 

The difference equation (9) may be solved by a method analogous to 
that of Weaver previously described, or by expanding the function 
l//(0 in a power series of the form: Oo+Oi(<+f"‘)+fl 2 (i*+f^)+ • • •. 

IrUerpolation by “quadratic cross-niLans.” Another type of practical 
interpolation problem is that in which only a single isolated value, or 
only a few such values, are desired. Probably the best method of es¬ 
timating such values for empirical data is the classical one of fitting a 
polynomial of d^ree k to k+1 given values. One of the most effective 
methods of carrying out the computation for such interpolation is the 
iterative process given in 1932 by Aitken [2], which has recently been 
improved by Fdder [9], who demonstrates Aitken’s basic process in the 
following manner. Let it be required to compute the value fiQ of a 
polynomial of the nth d^ree,/(®), given/*=/(®*) for Jb=-0,1, • • •, n. 
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fn Xo f 

We define /«>(aj) ^ (® - * 0 ) 

/(*) ® - I 


and note that it is a polynomial of degree »—1, and that/<“({) =/(f). 
Hence we are required to find knowing 


/*{!) = 


/o a:o - { 
fk Xk- ^ 


■i- (®i - ®o) 


for 2, • ■ •, n. Thus the problem has been reduced to a similar 
problem in which the number of given values is one less, and in like 
manner it is further reduced to n—2, and so on. 

The improvement introduced by Feller arises in interpolating by 
"quadratic cross-means.” The latter process, which origiiuiJly appeared 
to work only for an even number of data, can now be equally well 
adapted to an odd number. However, it is still necessary that the given 
data be S 3 nnmetrically placed with respect to some i>oint x—a. The es¬ 
sential feature of the method consists in regarding the interpolation 
polynomial as a polynomial of lower degree in the sqmre of the inde¬ 
pendent variable. This device reduces by about one-half the number 
of steps in the iterative process. Denote two symmetrically placed 
points by Xk and x-k. Tho point is included among the ^ta only 
if « is even. Consider 


^(x) = 


/(2o - X) 

/(*) 


2a — » — i 

X — i 


- 5 - 2(» — o) 


^(») 


— /(2o — x) 2o — X — J 
/(x) X - J 


■ 4 - 2(x — o). 


Obviously ^(x) and if'(x) are even functions of x—a, and hence poly¬ 
nomials in f=(x—a)*. Moreover, 

If n==2»»—1, tho problem is reduced to finding the value for 
(f—o)® of the polynomial of (m—1) th degree ^(o-i-v/f)» given its values 


<^k 


(Xk — x^) 


/_* x_» — 5 
fk Xk - i 

forf=(x*—o)*,fc=l,2, •••,«». 

On the other hand if »=2im, we compute the values 

—/_fc x_» — S 
fk Xk-i 


4'k — 


-4- 2({ - o) 


for <= (x*—o)*, fc=0,1, ■ • •, m, and proceed as before. 
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SAMPLING ERRORS IN MORTALITY AND OTHER 
STATISTICS IN LIFE INSURANCE* 

D 0 NAI 4 D D. Cody 

The Equitable Life Assurance Society of the United States 

The types, sources, and extent of data entering into the ana¬ 
lytical statistics in life insurance companies are outlined. The 
statistical errors in mortality investigations arising as a result 
of the limited extent of the death claim experience or arising 
from the use of exposure sampling techniques are discussed. 
Comments arc also made about the experience rating problem 
in group insurance and about a possible application of sequen¬ 
tial analysis to the checking of files. 

TYPES, SOTIRCES, AND EXTENT OP DATA 

i CTUAMES are concerned -with a number of different tsrpes of statis- 
J\_ tical rates and probabilities. The most important of these are the 
mortality rate, the rate of occurrence of permanent disability, the rate 
of termination of such disability by recovery or death, the accidental 
death rate, the rate of claim \mder accident and health policies, the rate 
of claim under hospitalization and medical reimbursement policies, and 
the rate of termination of policies. The somrees of data for making es¬ 
timates of these rates and probabilities are the company’s files, which, 
in some cases, are designed primarily for furnishing such data, and in 
other cases, are designed primarily for other purposes. A description of 
certain files from which data for life insurance mortality investigations 
are taken in the Equitable Life Assurance Society will serve as an il¬ 
lustration. 

The basic statistical data regarding death for a life insurance mor¬ 
tality investigation come almost exclusively from the death claim file, 
which is a file of I.B.M. punched cards prepared from the claim rec¬ 
ords filed when a policy becomes payable by death. A specimen card is 
diown in Eig. 1. Each calendar year some 16,000 to 18,000 of these cards 
are added to these files. 

The primary source of information concerning the exposure to the 
risk of death in lai^e oveivall mortality investigations of standard 
medically examined business comes from another file of I.B.M. punched 
cards wldch is used primarily for the annual valuation of the company’s 
reserve liability. This file is summarized mechanically to give the num¬ 
bers and face amounts of insurance on policies in force on December 31 

* A paper presented at the 107th Annual Meeting of the American Statistioal Afieodalion, New 
York City* Deoember 30* 1047. 
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FIGURE 1 

DEATH CLAIM DETAIL CARD 


of each calendar year; these are classified according to rating group, 
plan of insurance, calendar year of issue, and age at issue. The data 
from this file must be adjusted to differentiate policies mth some spe¬ 
cial attribute which makes them non-homogeneous with the bulk of 
the policies, e.g., policies issued without a medical examination. There 
are now about 2,000,000 cards in the valuation detail card file of the 
Equitable, and the file is increased by a net addition of 50,000 to 
100,000 cards each year. However, the sununaiy cards (see Kg. 2 for 
a specimen) actually used number less than 100,000. 
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It is also necessary to keep a special mortality class file of I.B.M. 
punched cards, which are made up from application papers at issue, in 
the case of special policies such as those involving medical impairments 
and occupational hazards (sec Fig. 3). There are about 1,250,000 cards 
in these files, representing all policies issued since 1925 to policyholders 
with such impairments or occupations. This may seem like a rather 
large number of cards, but since about 400 medical impairment classes 
and 250 occupational classes have been studied in the last twenty years, 
the number of exposures and the number of deaths in some of the classes 
are very small. 
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FIGURE 3 

MEDICAL AND OCCUPATIONAL STUDY DETAIL CARD 

These files are so constituted that data on the main medically exam¬ 
ined standard business and on all the special classifications can be com¬ 
bined with such data from the larger life insurance companies to give 
over-all intercompany experience. This pooling of data is made, an¬ 
alyzed, and published by the Joint Committee on Mortality of the 
Actuarial Society of America and the Association of Life Insurance 
Medical Directors. 

THE ANALYSIS OP MORTALITY AND OTHER STATISTICS 

(a) Sampling errors in death claims. The primary problem facing the 
actuary is the efficient estimation of the true underlying probabilities 
from the statistics available. With respect to the large bulk of standard 
medically examined business, such large numbers are involved that 
sampling errors in the deaths themselves at each age and policy year 
are relatively small. The only serious complication arises from the fact 
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that the primary interest is in the financial effect of claims; it is cus¬ 
tomary, therefore, to analyse the mortality rates for the most part in 
terms of the amounts of insurance exposed and the amounts of claims 
paid. However, it is a relatively simple matter to take into account the 
effects of variation in the amounts of insurance upon the standard 
deviations used to test the reliability of the statistics [1]. 

For instance, the standard deviation in the mortality rate by 
amounts, defined as the ratio of the total amoimt of death claims in 
dollars, over the total amount of insurance exposed in dollars at age x is 



where 




A / 






g, = mortality rate by amounts = 

8 = amount of insurance in dollars on individual life (mid¬ 
point of class interval) 

= number of lives exposed with policies of s dollars 

=» E 

« 

E»fc 

« 

o', = total actual death claims in dollars. 

A t 3 rpieal size for R, is 2. 

The process of graduating a crude series of values of the rate of mor¬ 
tality has the effect of reducing the variance of the individual rates of 
mortality, but this process is so thoroughly described by T.N.E. 
Greville elsewhere in this issue that mere reference is sufficient here [2]. 

On the other hand, for the investigation of the death rates in special 
mortality classifications, there are important problems involving the 
design of categories in which the data are analysed, the combination of 
data within a number of independent categories into relatively homo¬ 
geneous groupings large enough to give statistically reliable results, and 
the deteimination of reliable measures of dispersion: on these problems, 
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only slow progress is being made. We are proceeding gradually in the 
redesign of the classifications themselves as a result of our growing 
knowledge of the salient attributes of each impairment or occupation. 
The redesign probably can most efficiently be carried out by a working 
group of a doctor to supply the necessary clinical infoimation, an under¬ 
writer to supply information concerning the acceptance of each risk, 
an actuary to supply information concerning the monetary effects, and 
a statistician to provide a proper appreciation for homogeneous and 
significant categorization in the light of previous statistics. 

(b) SamjMng errors due to exposure sampling. So far, the discussion 
has only been concerned with sampling errors in the estimation pro¬ 
cedures arising from the limited size of the experience from death 
claims. In some instances, because of the cost or time involved in 
bringing up to date a file needed for determining exposures required in 
some particular estimation problem, at the present time consideration 
is bdng given to taking, say, a 10 per cent sample of all policies issued 
in a particular period by pulling out record cards with policy numbers 
ending in a particular digit. This sample will be studied for the point 
under investigation, and the records characterized by this point will be 
broken down into the necessary number of smaller relatively homoge¬ 
neous categories, thus giving the exposure to risk of death in each such 
cat^ory. These exposures will then be combined with a 100 per cent 
sample of the corresponding death claims from the death claims file; 
the mortality rate is then the ratio of death claims to the exposure 
estimated from the sample, viz.. 


e 



where mortality rate 

deaths 

total exposure as estimated from the sample exposure 

m 

100 8« percentage sample used in determining exposure 

In the case of an investigation with huge exposures in each category 
with lives used as units Cin lieu of amoimt units), the variance of the 
mortality rate determined by such a procedure is larger than the vari¬ 
ance in the mortality rate determined from a 100 per cent exposure 
sample, in approximately the ratio 


1 + 


(1 - «)g 

8(1 - q) ' 
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In the event that q involves a number of ages, and the exposure is taken 
over a period of years so that each life may enter into the exposure a 
number of times at different ages, this ratio is increased by a serial cor¬ 
relation effect to the approximate value 


1 4 


m - s)g 

s(l - q) 


where J is a type of wwghted average over-all mortality rate and 
r is an average period of exposure per life in the category. 

The introduction of amounts of claim for the purpose of estimating the 
mortality rate by amormts further complicates the problem. This gen¬ 
eral problem may be found discussed in the TransacHona of the Actu¬ 
arial Society of America [3, 4]. 


THE EXPEDIENCE BATING PBOBLEM IN GBOUP INSTTBANCE 


A very important type of insurance policy today is the group policy, 
which insures under a single contract the health, retirement, and death 
benefit needs of all employees in certain employment categories of a 
single employer. Under this type of coverage there is a very complicated 
statistical estimation problem involved in detenoining the claim 
charges (credits in the case of annuities) to be assessed against each 
employer in the annual not cost. In order to establish proper equities 
and to assure relatively stable cost to the employer from year to year, 
it is necessary to base these claim charges partially on each employer’s 
own experience and partially on the over-all experience in the particu¬ 
lar premium group under which the employer’s contract is classified. 
Naturally, the larger the number of employees and the older the em¬ 
ployer’s contract, the more weight can be pven to the employer’s own 
experience. 

One type of formula used in the case of group life insurance for the 
claim diarge against a particular employer is the following; 


P 

A+P 


K-[ 


A 

A+P 


kP 


where P=total premiums received from the employer during a stipu¬ 
lated period of time 

K =actual claims in the employer’s group during the same period 
k => ratio of total death claims to total premiums over the whole 
giDup life insurance experience for the same period 
A =constant chosen usually by practical considerations 
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This function in the case of K and P measured on a ^ves” basis has 
been determined theoretically by simple probability considerations by 
R. Rcffer [5]. Mr. Keffer assumes that the ratios, r, of the mortality 
rates for each group to the mortality rates for all groups combined are 
independent of age and are distributed according to a Pearson Type 
III curve as follows: 

(m + + l)r]" 


where m is a parameter related to ri in the previous paragraph. Since 
the probability of death is small, the Poisson Law gives the probability 
of esperiencing d deaths in a group governed by a ratio r as 

e-”(rc)'* 

d! 

where c is the expected number of deaths based on the experience of all 
groups. Therefore, the probability of the ratio being of value r and of 
there being d deaths is 

(m + l)"+‘c‘'(m + d)l e-(”+i+«)*'[(TO + 1 + c)r]”*'‘‘* 

f/z s= -- - - - ---- —---- --- - • 

(m + 1 + c)”+‘*m!d! (m + d)! 

Theoretically m could be determined by fitting this function to the com¬ 
pany experience, although the resulting value might prove impractical. 
The mean value of r is then foimd from this to be 

wi -j- 1 -f" d 

f = 

m -|- I + c 

so that the mean value of the deaths in the group is 

m + 1 c 

fc --c _!- 

OT-l-l-l-C 

which may be used as an estimate of the average number of deaths 
properly assignable to the group for cost purposes. If we sot m+1= Ah, 
c—hP, d^K on the assumption that each life is insured for unity, 
then 

A , P 

fc = --- kP + - -— K as set forth above. 

A+P A+P 

There is a need, however, for a reconsideration of the theory of this 
problem, taking into account distributions of exposures and deaths by 
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age and amounts of insurance in the whole premium classification and 
in the individual group. Functions of a stochastic variable in the 
theory of risk as developed by Professor Cram6r and others of the 
Scandinavian actuarial school may prove to be useful tools in obtaining 
insight into this problem [6]. It should also be mentioned that the 
casualty insurance actuaries have given considerable attention to this 
general problem [7, 8, 9]. 

THE CHECKING OF FILES 

An interesting problem exists in connection with the checking of 
such files as the special mortality class files from which basic statistical 
data are drawn. It is desirable to have some trustworthy means of con¬ 
firming the accuracy of the files without checking the whole back 
against ponderous original records. We are concerned with errors of 
various levels of importance, such as, for instance, the entries of deaths 
and withdrawals. We are currently examining the feasibility of setting 
up acceptance criteria to be applied by a binomial sequential anal 3 rsis 
procedure [10] to a special mortality class file of some 500,000 I.B.M. 
punched cards. 

We have in mind sorting out sequential samples in groups consisting 
of ail cards in a particular impairment group characterized by an im¬ 
pairment code punched in the cards. Such a sample will involve cards 
into which entries of transactions diould have been made over the 
period involved in our investigations. Each card will be compared 
with basic records with respect to all transactions such as, for instance, 
death and withdrawal, and a tabulation will be kept of the number of 
cards on which each t 3 rpc of transaction should have been made and the 
munber of errors found in each type of transaction. In the case of 
death tranhactions, for instance, we feel that we can sustain an error 
of the order of 2.5 per cent of all death transactions without seriously 
biasing our conclusions, bearing in mind that sampling errors exist in 
any case by reason of the limited extent of our data. Our criteria might 
then consist of keeping below 2 per cent the probability of deciding 
the file needs complete rcchecking when the actual errors are below 
1 per cent and of keeping below 1 per cent the probability of accepting 
the file as satisfactory when the actual errors are above 5 per cent. The 
average sample number is below 300 for these particular criteria, so 
that probably no more than 600 transactions would have to be inves¬ 
tigated before the test procedure terminated in the acceptance or re¬ 
jection of the fiile. Depending on the percentage of death transactions 
among the cards sampled, which is largely a function of the period of 
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years covered, we would hope to investigate no more than 10,000 to 
20,000 cards out of the 500,000 cards. K the file does possess the desired 
degree of accuracy, this procedure appears to offer an economical 
means of confirming the fact. Of course, if too many errors are present 
in the fiGie, an extensive effort is indicated to correct it. 
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THE EDGE-MARKING METHOD OF ANALYZING DATA 

L. L. Thubbtonb 
The University of Chicago 

About thirty years ago the writer introduced a simple card 
method of analyzing data. The method has been used by many 
students and others. It is especially adapted to problems with 
experimental populations of not more than about four hundred 
eases in which it is desired to analyze the relations of a large 
number of attributes. The number of attributes can be rather 
extensive, up to three hundred if necessary. The writeris 
6 by 8 cards for edge-marking have been reprinted many times. 
Most of the users of those cards are familiar with only one or 
two applications. The purpose of this paper is to describe 
enough examples to illustrate the versatility of the edge-mark¬ 
ing method. 

O BiGiNAiiLY the provocation for devising the method was the fre¬ 
quent necessity for analyzing data on populations of several 
hundred subjects in situations where the pimched card methods were 
not available or where they were too expensive. This was often the 
case with students who could not afford the expense of punched card 
methods and in which it was often questionable whether the size of the 
problem would justify the punched card procedures. When the number 
of attributes exceeded the number of columns on the punched card, it 
was necessary to resort to double punching with consequent complica- 
. tions in handling the analysis. Methods have been available for punch¬ 
ing holes or slots in the edges of cards. These methods enable one to 
handle* a large number of variables and they are often very useful. 
Special equipment is necessary for their use which is often not avail¬ 
able for students who work on problems of limited size. The card 
method that wo have boon using increases still further the number of 
attributes that can be recorded on each card because we use pencil 
marks instead of holes along the edges of the card. This makes possible 
the use of both sides of the card for different attributes. The total 
edge length that is available is therefore twice the perimeter of the card. 
An\)rdinary 6^l>y 8 card provides 52 inches of edge length along which 
to record data about the individual who is represented by the card. Sta¬ 
tical problems can be handled by a student alone without special 
equipment beyond the printed cards. For many years the edge-marking 
method has been known as the ‘‘slant-cut” card system, but the slant- 
cut on one corner of the card has only incidental reference to the meth¬ 
od of statistical analysis with these cards. The cut in one comer of the 
card serves the same purpose as the comeivcut on the usual I.B.M. 
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punched cards, namely to make conspicuous any card that has been 
inadvertently turned over. 

In Fig. 1 we have a diagram of the 5 by 8 card. Each card is first 
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identified with the name of the individual subject who is represented 
by the card. Around the edges of this card there are 130 short lines, 
66 on each side of the card. Each line extends to within a short distance 
of the edge of the card, and each line is identified with a number. A 
master list is prepared showing the attribute that is represented by 
each code number. It is often desirable to represent the presence or 
absence of each attribute. This is usually done by black and red pencil 
marks which continue the printed line to the edge of the card. When 
information is lacking about an attribute, the corresponding space is 
left blank. Accentuation of the attribute can be represented by a double 
red line, or a double black line, at the appropriate space on the edge of 
the card. These ideas can be varied indefinitely to suit particular 
problems. We have found that it is not desirable to use many colors. 
We ordinarily limit the use of color to red and green together with 
black. In some problems the several gradations, or class-intervals of a 
variable, can be represented by several colors. In other cases, where 
there is plenty of room on the card and the number of variables is not 
large, several spaces are provided for the different gradations or class- 
intervals of each variable. Again, these decisions are adapted to each 
problem. 

Perhaps the most common application of the edge-marking method is 
in the preparation of four-fold tables for a large number of attributes. 
As an example, suppose that each attribute has been given a code 
number and that the presence of an attribute is denoted with a red 
mark while the absence of the attribute is denoted with a black mark 
at the edge of the card. For this example we shall assume that only the 
code numbers 23 to 34 inclusive have been used in the positions shown 
at the top of the card in Fig. 1. 

When a card has been prepared for each subject in the experimental 
population, let us start with the tabulation of all four-fold tables that 
involve attribute 23. The cards are stacked and then fanned so as to 
expose the edges as shown in Fig. 2. The cards arc then separated ac¬ 
cording to Ihe edge marks in the position 23. All cards with black marks 
at this position arc sorted into one pile, and all cards with red marks are 
sorted into another pile. We shall assume here that there are no blank 
entries. Each of the two piles is then fanned so as to expose the edges 
as shown in Fig. 2. The cards that represent negative answers to item 
23 will show a sold black line at position 23. The other stack of cards 
to the right will show a solid red line in the same position. In the figure 
we arousing cross hatching to represent the red color. The cards with 
the solid rod line at 23 represent the individuals with positive answers 
to item 23. 
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FIGURE 2 

The two piles are separated slightly in stacking them so as to make 
the code niunbers visible for both piles as shown. The figure extends 
only to item 34. A data sheet is prepared as shown on the right side of 
the figure. This table is prepared for recording the four cell frequencies 
for each four-fold table. The first column shows the number of black 
marks in each row of the first pile. The second column shows the num¬ 
ber of red marks in each row of the first pile. The third colunm shows the 
number of black marks in eadi row of the second pile, and the fourth 
column shows the number of red marks in each row of the second pile. 
Consider a particular example as shown in the four-fold table at the 
botton of the figure. This table shows the association between the at¬ 
tributes 23 and 24. The association is evidently negative in that those 
who have attribute 23 tend to be lacking in attribute 24. This fact can 
also be seen by inaction of the row of edge marks at the level 24. It is 
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immediately apparent that the proportion of black marks is larger for 
the second pile than for the first pile at the level 24. Since there are as 
many as twenty marks on a single edge of the card, it is evident that 
one can inspect the relations of attribute 23 with as many as twenty 
other attributes in one position of the cards. Twenty four-fold tables 
can be prepared from this position of the cards. 

When the cards have been inspected, or counted, for all the attri¬ 
butes on one edge, 1b.e two piles can be turned to each of the other 
edges and the process can be continued until all of the four-fold tables 
have been prepared for attribute 23 which determined the separation 
of the cards into two piles. There are eight e<^es to be thus examined, 
four on each face of the cards. The cards can then be sorted into two 
piles according to each other variable that is to be analyzed. 

Sometimes it is desired only to inspect the cards without countii^ 
the actual frequencies. After separating the cards into two piles accord¬ 
ing to presence or absence of a particular attribute such as a criterion 
measure, one may inspect the cards in search for those other attributes 
that correlate with the criterion measure. If the proportion of red and 
black marks is about the same in the two piles, one may skip the com¬ 
parison without bothering to count the actual frequencies. If we do 
that for the example of Fig. 2, we find, for example, that attribute 26 
has some relation to 23 even before coxmting the red and black marks. 
When we come to 28, we see that this attribute has a striking relation 
with 23 because there are many black marks on the left but only one on 
the right. Similarly, item 34 shows some relation to 23 because of the 
relatively greater density of black marks on the right. This example 
illustrates the use of the edge-marking method for inspectional purposes 
whore only some of the frequencies are actually counted. In attaching 
significance to riie relations found, it must of course be remembered that 
significance is determined in part by the total number of relations that 
are inspected. 

In the present example we have assumed that there were no blank 
entries on the data to be recorded. If the original data contain blanks 
for some of the items, these can be represented by merely omitting the 
edge mark for the items in question. In some problems, it is of interest 
to know which questions draw blanks in the answers, and sometimes 
rixe blanks are considered to be significant as evasions in response to 
some questions. The blanks can be inspected as to relative frequency in 
the two groups of subjects that are being compared. When the cards 
are stacked as rirown in Fig. 2, the relative frequencies of the blanks in 
the two piles can be inspected and coxmted if they are significantly 
different. 
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We shall list here a niimber of t]rpical uses of the edge-marking meth¬ 
od. 

1 . The preparation of four-fold tables for any number of pairs of 
variables. These can be prepared in a fraction of the time that 
would be required by ordinary methods of tabulation and 
counting. 

2. The preparation of tables of multiple classification. These are 
the cases in which the variables are classified into more than two 
categories or class-intervals for the calculation of Ohi square and 
other indices. 

3. The inspection of data for questionnaires to discover significant 
association between variables without counting all of the fre¬ 
quencies. 

4. The item analysis of questionnaires, psychological and educa¬ 
tional tests, application blanks, and other schedules against some 
criterion or dependent variable. 

5. The reduction of a large number of attributes or variables to a 
limited nmnber of groups or clusters. This procedure is especially 
useful when the problem is to determine the dimensionality of a 
domain that is represented by a large number of discrete items. 
The composite scores for the clusters can then be studied by 
factorial methods. 

6 . The calculation of composite scores for a group of attributes. 
Each individual score may be the number of attributes in a group 
of attributes which are positive in that individual. This calcula¬ 
tion is done with a scoring frame to be described. 

7. The scoring of an individual performance when the subject 
classifies or sorts cards. In a factorial study of perception the 
writer included several tests of the optical illusions.^ These were 
arranged on cards which the subject was asked to sort to the right 
or to the left according to the apparent magnitudes in the dia¬ 
grams on the cards. The scoring was greatly simplified by edge¬ 
marking. The examiner merely picked up the stack of caids 
which the subject had placed in the ri^t-hand compartment 
and turned them over. The number of right responses was 
counted by the edge marks that were on the backs of the cards 
so that they were not visible to the subject. 

When the examiner turned over the cards, he arranged them 
so as to espose the edges as shown in Fig. 2. He placed the stack 

1 L. L. Thuntonet A Factorial Study of Pereeption, The Univeisity of Chicago Press, Chicago, 
1M4, tests 27-31, induslTe. 
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of cards on a data sheet as shown at the right of the figure. A 
frequency distribution was then made by counting the number 
of edge marks in each row. This procedure gave immediately the 
distribution of magnitudes because each card was edge-marked 
on the back side at a position which designated the magnitude 
of the right-hand figure in the optical illusion on that card. The 
cards were then shuffled for the next subject. 

This work of scoring was done in a fraction of the time that 
would have been required by ordinary methods of tabulation. 
The Mionesota Multiphasic test has also been scored in this 
manner with edge marks on the backs of the cards. These are 
made by the examiner to simplify his work, but the edge marks 
could be printed on the cards by the manufacturer. 

8 . Bibliographic cards can be edge-marked to very great advan¬ 
tage. In every field for which a bibliography is assembled, there 
are a number of categories or subheadings. Each journal article 
or monograph is represented by a card. Instead of preparing 
duplicate cards for cross reference purposes, one card can be 
edge-marked on the back, or on the front, to show the several 
categories for which the article is relevant. If it is desirable to 
collect all the references in a particular category, one need only 
assemble all of the cards that have an edge mark in a particular 
space. When the compilation has been completed, the cards can 
be re-sorted into the general alphabetical file. This method 
saves the labor of duplicating cards or of writiug explicit cross 
references for each card. When bibliographic cards are prepared 
and used by an investigator, it is often a matter of considerable 
importance to reduce the clerical labor to a m inimum. A S 3 rstem 
that is crimbcrsomc and which requires much writing is likely to 
be discontinued. 

9. A practical application of the same kind can be made on a mail¬ 
ing list. If every person on the mulling list is represented by a 
separate card, it can be edgo-marked to indicate the categories 
to which the name belongs and the kinds of material that should 
be mailed to that person. The fact that certain material has 
actually been mailed to a group of individuals can be indicated 
by edge-marking a stack of cards with a single pencil stroke at 
the appropriate space on the cards when they are fanned so as 
to expose all of the edges. The mailing of different types of ma¬ 
terial can be controlled from one master card file. These methods 
are not necessarily the most appropriate in large offices. 
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10. The filing of library cards and the checking of the file could be 
facilitated by edge -ma rkin g on the cards. The library classifica¬ 
tion could be represented by a series of marks on the edges of the 
caids so that the marks would be visible when the cards are 
stacked tight in a drawer. Any card that is misplaced would be 
immediately seen because its markings would not be continuous 
with those of adjacent cards. In this case it should not be neces¬ 
sary to fan the cards because the marks could be printed on the 
edges when the cards are stacked. This could be done by the 
printer when the cards are manufactured. The writer has sug¬ 
gested this application to librarians but the method evidently 
has not yet been tried. 

11 . If it is desired to present a set of cards in a uniform order to all 
subjects, the desired order can be represented by edge-marking 
either on the front or back of the card. When each subject has 
finished his categorization of the cards, they can be re-sorted by 
means of the edge-marking into the original order for the next 
subject. The alphabetized order can be represented by edge- 
marking because the edge-marking is perceived more quickly 
than the aphabetical order. 

12. When paired comparison data are to be tabulated, the proportion 
of ju(^ments, j>h,m which j is preferred to k, can be facilitated 
by edge-marking. This adaptation can be made in several ways 
depending on the original form of the tabulations. 

13. A test form or preference schedule can be so arranged that the 
subject’s responses are made at the edges of a card, the central 
portion of which is devoted to the text. The subject’s responses 
can then be analyzed by the edge marks which he has made him¬ 
self. This procedure saves ad^tional time because the edge¬ 
marking is then already made by each subject. 

For some types of statistical problems we have used a rack for hold¬ 
ing the cards. This application can be illustrated by an example in 
which we used the larger card, namely, 10| by 12 inches. This was the 
laigest size of card that could be filed in a vertical filing case of standard 
dimensions. In using the large size card it is best to have a rack on which 
to place the cards for the kind of comparison that was described with 
Fig. 2. The rack consists of a board that is moimted at an angle of 
about sixty degrees with the surface of the table. At the lower edge of 
the board is a ledge on which the cards rest. Two or more groups of 
cards can be arranged on tibis board for inspection of the edge marks. 
When using the rack for countug frequencies, especial]^ on the large 
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cards, the work is facilitated by using a long flexible ruler or preferably 
a horizontal irule that can be clamped in position for each row when the 
frequencies are being counted or inspected. 

The use of the scoring frame will be described in terms of a recent 
study in the Psychometric Laboratory at The University of Chicago. 
A schodule of 330 questions had been assembled on temperamental 
characteristics. The subject was asked to respond to each question with 
one of three responses, namely, yes, no, or a question mark. The 330 
responses were edge-marked on a single card for each subject. The 
assumption was made that the large collection of questions represented 
a much smaller number of factors, dimensions, or types. The purpose 
of the study was to make a factorial investigation of the dim ens ionality 
of the domain that was represented by the entire list of questions. For 
this piurpose, it was desirable to discover first the main groupings of the 
items and to make the grouping in such a manner that all of the items 
in each group should be as closely related as posdble. 

The items were first arranged in tentative groups according to pre¬ 
liminary interpretations. Then we wanted to as^gn to eadi subject a 
composite score for each group of items. For example, one group of 
items was concerned with rate and amoimt of activity. A composite 
score for such a group of items would be the subject's self-rating on 
general activity. A very active person would be expected to have a high 
composite score while a calm and deliberate or phlegmatic person would 
be expected to have a lower rating on this characteristic. Such a com¬ 
posite rating does not imply that the trait will be revealed as a primary 
factor. It mi^t turn out to be a combination of other more general or 
fundamental factors. Before such a composite rating can be accepted 
even for preliminary study, we should have some assurance that the 
several individual items in each group are highly correlated with the 
composite score for the group. It was for this type of analysis that we 
wanted a composite score for each subject in each group. Those items 
which did not show close relation with the composite score were to be 
eliminated from the group and other items woifld be added by inspec¬ 
tion of the edge marks. 

The composite scoring frame is represented diagrammatically in 
Fig. 3. For simplicity of explanation, we shall assume that all of the 
items in the group to be analyzed are represented on the upper and 
ri^t-hand edges of the front face of the edge-marldng card. The scon- 
ing frame has the four sides A which enclose a rectangular space equal 
to that of the edge-marking card. One of tire individual cards is placed 
into this rectangular space. The upper and ri^t-hand edges of this 
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card are shown at £ in Fig. 3. A master card C is prepared for each 
group of items. This card is slightly smaller than the edge-marking 
card. The master card in the figure has been moved in the direction D, 
toward the lower left comer of the frame. When the master card is 
moved toward the lower left comer of the frame, the edge marks be¬ 
come visible along the upper and r^t-hand edges as shown in the 
two edges B. Let the cross hatched marks represent red edge marks on 
the individual card. 

Let us suppose that nine items have been grouped in the belief that 
they are closely related. These nine items are represented by edge 
marks on the master card C. Bed and black edge marks have been made 
on the master card according to whether positive or negative responses 
to the items represent presence or absence of the postulated general 
trait. When the master card is in the position shown in Fig. 3, one can 
readily count the number of items on which the subject responded in 
the expected manner if he excelled in the group trait that is represented 
by the master card. The composite score for the individual subject is 
the number of items on which he responded in a manner similar to that 
which is marked on the master card. In the example of Fig. 3 we find 
that the individual subject agreed with the master card on seven of the 
nine questions. On the top edge, for example, we find that the subject 
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made two positive responses and one negative response that agreed 
with the master card. The responses at F and H agree with the master 
card while the response at 6 disagrees with the master card. The slant- 
cut at J? is also made on the master card so as to insure that the master 
card and the individual card are properly matched. 

When the count for the composite score has been made, the result 
is recorded on the individual card. The arrow K on the master card 
indicates where the composite score for this group of items should be 
recorded on the individual card. The composite score 7 is now recorded 
on the individual card. The same procedure can be followed for com¬ 
posite scores on other groups of questions or items. The composite 
scores for different groups are, of course, recorded in different positions 
on the individual card. WTien all of the composite scores have been 
recorded on the edge-marking cards, these can be tabulated in a fre¬ 
quency distribution. The median composite score is then determined, 
llie cards can then be marked with red and black to indicate whether 
the composite score for an individual is above or below the median or 
mean. The cards are then ready for the preparation of four-fold tables 
to show the relation between the individual item responses and the 
composite score for each group. Similar analyses can be made for 
quartiles or for any other groups. Those items that do not agree with 
the composite can be eliminated and other items can be discovered by 
inspection that should be included instead. 

In the present example, we have assumed that all of the items in a 
group were on the two edges of the card which are simultaneously visi¬ 
ble. If the items spread over more than the two edges, the master card 
is moved in the direction E. That exposes the edge marks along the left 
edge and the bottom edge of tlic card. If the group items extend over 
both faces of the canl, then the individual card and the master card 
are turned over so that the coimting can be continued on the second face 
of the card in the same manner. The master card is then also marked on 
both faces. ITiose procedures are much easier to show with actual 
manipulation than with the awkwardness of verbal description. Item 
anal 3 rses of various kinds can be quite easily handled with these edge- 
marking methods. 

The edge-marking cards can be marked directly from the original 
data. When the data are arranged in columns, it is convenient to place 
the edge of the card next to a column of data and to transfer the column 
of records directly to the adjacent edge of the card. In order for this 
procedure to be feasible, the spacing of the columns on the original 
data sheets should be the same as that of the edge-marks on the card. 
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One of the writer’s students, Mr. Albert Hunsicker, made a practical 
application by using blank cards instead of the printed cards. He 
adopted a spacing for the edge-marking that matched the spacing on 
his data sheets. The writer has used blank cards in the same manner. 
The card shown in Fig. 1 was arranged to fit double spaced typewriting. 
If the original data sheets have such spacing, then the data can be trans^ 
ferred to the edge-marking card directly by placing the edge of the 
printed card next to the column of data. This method of making the 
edge marks saves much time and avoids clerical errors. 

In making the edge marks it is advisable to avoid a soft pencil be¬ 
cause such pencil marks are likely to rub off to adjacent cards when 
they are handled. We have used red and black India ink to advantage 
on cards which are to be handled for a large number of tabulations. A 
rather fine pen point seems to be adequate for edge-marking. If the 
cards were to be manufactured in large quantities, they might be made 
with a border that is sli^tly thinner than the rest of the card. The edge 
marks would be made on this slightly thinner border and they would be 
less likely to rub off onto adjacent cards even with extensive handling. 
The difficulty is avoided by using a medium-hard pencil or by using ink. 

The flexibility and economy of the edge-marking method of classify¬ 
ing and analyzing data justify more general use than it has had in the 
past. The method is especially useful for those problems in which the 
investigator works either alone or with some clerical assistance and 
where tabulating equipment for pimched card methods is not readily 
available. For some problems with relatively small populations and 
with many attributes for each individual subject, the writer prefers 
the edge-marking method to the punched card procedures because the 
significant relations can be identified by inspection without actual 
counting and tabulation. The several examples of this paper are in¬ 
tended to illustrate the versatility of edge-marking which can be 
adapted to a great variety of purposes. 
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PART n 

CHARACTERISTICS OP THE WPB REPORTING STRUCTURE 

An understanding of the scope of the WPB reporting struc¬ 
ture demands more than a knowledge of its development as 
described in the first article of this series. Further insight into 
its intricacy and breadth is given by an analysis of its quanti¬ 
tative and qualitative characteristics. 

NUMBER OP WPB QUESTIONNAIRES 

N umerical statements concerning WPB questionnaires must be 
viewed with caution. The numbers of requests for data, and the 
figures purporting to reveal their character, have serious limitations as 
units of measure. Reporting forms varied in length and complexity 
from a multipage form, such as PD-25A, to a simple registration state¬ 
ment requesting only a name and an address. Some complicated forms 
were often little used by management and simple questionnaires fre¬ 
quently had a high administrative value. Such factors ^ould be kept 
in mind in appraising the following analysis. 

From March 1941 to September 1946, a total of 4,083 different forms 
wore approved for issuance by the WPB. This figure includes a total of 
2,397 repetitive or recurring forms, and 1,686 “one-time” forms. These 
figures do not take into account the “wild-cat” forms submitted to in¬ 
dustry without formal approval, those forms continued by other gov¬ 
ernmental agencies and trade associations for use by the WPB, forms 
submitted for formal approval but disapproved (the disapproval rate 
was about 10 per cent of the forms submitted), or the unknown but 
substantial number of forms that were discussed and then withdrawn 
by the originating unit before formal submission. The total figures 
compare with 309 questionnaires issued by the War Industries Board in 
World War I, 
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QTTANTITATITB ANALYSIS OF BBPETITIVE FOBMS IN USB 
IN DECEMBEB 1943 

A quantitative analysis of the recurring questionnaires in existence 
at one point of time gives an enlightening cross-section view of the 
WPB reporting structure. The study reveals the character of data used 
by the WPB management since continuous surveys were, for the most 
part, the instruments upon which administrative action was founded. 
The techniques used on forms at one point of time, however, should 
not be considered representative of the entire war period. Certainly 
the discussion in the previous article indicated the hazards in drawing 
such a conclusion. 

The following study of questionnaires in use in December 1943 was 
derived from an Analysis of the Use of WPB Forms prepared by the 
General Statistics Staff of the WPB and published in December 1943.^ 
During December 1943 there were in use a total of 707 recurring forms. 
It is useful to compare this total to counts of repetitive reports in use at 
various points of time. The numbers were: September 1942, 609; 
January 1943, 586; May 1944, 903; and August 1946, 442. 

In the following study a number of one-time and irregular reports 
are added to the 707 repetitive questionnaires in use in December 1943 
to make a total of 786 forms analyzed. The 786 forms are classified into 
the following uses: statistical, 508; application, 305; scheduling, 48; 
and other, 95. Included in the “other” category are registrations, cer¬ 
tificates, bills of material, reports of surplus stock, and appeals. Since 
one form often had several uses the total number of these 786 forms 
amounted to 956. 

WPB questionnaires and reporting instruments were predominantly 
of three types: statistical, application, and scheduling. Statistical ques¬ 
tionnaires were those designed to obtain information for use in policy¬ 
making and in serving as guides to action. They wore not vehicles for 
administrative action. Applications were primarily designed to serve as 
instruments for authorization by the WPB to industry to take certain 
action. Most of these applications required the submission of data 
that could be used for statistical purposes. The WPB learned that 
reporting was facilitated on a quid pro quo basis, that is, when the sub¬ 
mission of data resulted in authority to industry to do something. 
Scheduling forms were those designed primarily to obtain information 
on the dates at which a manufacturer proposed to ship a designated 


»See also Journtd of The American Statistical Assoeiaium, Vol. S9, No. 226, 1944, pp. 144-154; 
^Wartime Faeta and Peacetime Needs,* by Veridl D, Heed. 
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item to a specifically identified customer. On the basis of such reports 
the WPB “froze” or modified schedules. Many questioimaires embodied 
all three of these characteristics. 

The type of data requested on statistical forms is indicative of the 
sort of information which the WPB felt it needed for operations. The 
following summary derived from an analysis of the 508 forms used for 
statistical purposes, shows the number of forms requesting the type of 
information specified: 


Capacity 

Production: 


70 

Actual 


204 

Scheduled 


111 

Shipments 


271 

Consumption 


181 

Receipts or accepted deliveries 

178 

Inventory 

Orders: 


339 

Unfilled 


147 

New 


119 

Cancelled 


52 

Requirements 


102 

End Use Pattern 


128 

Other 


151 

The frequency of return of all 786 forms and the 508 statistical forms 

is summarized as follows: 



Frequency of 

Return 

All Forma 

Statistical 

Forma 

One-time 

79 

42 

Daily 

1 

0 

Weekly 

3 

2 

Semi-monthly 

8 

6 

Monthly 

404 

361 

Bi-monthly 

3 

0 

Quarterly 

88 

71 

Semi-annua>l 

1 

0 

Annual 

1 

1 

Irresular 

198 

46 


The proportion of the total maximum number of respondents filing 
each form was important for statistical reports. The extent of coverage 
of the 508 statistical forms was as follows: complete coverage, 383; large 
firms only, 55; sample of large, medium, and small firms, 22; and cov¬ 
erage uni^own, 48. 
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Analysis of the t3rpe of respondents shows some of the broad areas 
covered by the WPB's reporting. Of the 786 forms, 430 were filed by 
manufacturers; 88 by other types of processors, such as converters, 
tanners, refiners, foundries, and mills; and 159 by “consumers” who 
were largely, but not entirely, manufacturers filing forms as consumers 
of raw materials. At the distribution level there were 121 forms filed 
by other classes of respondents such as warehouses, distributors, whole¬ 
salers, importers and retailers. There were 6 surveys the respondents of 
which were utilities; 15 covered government agencies, and 85 covered 
other groups. (The total of groups is 904 and results from the fact that 
one form often covered several groups.) 

The number of respondents filing WPB questionnaires varied from 
one to hundreds of thousands. Form PD-789 (producers monthly re¬ 
port of fibrous glass production), for example, was filed by one com¬ 
pany. Preference-rating certificates, such as PD-lA, and various other 
application forms were filed by thousands of individuals, plants, and 
companies. Contrary to popular impressions the average number of 
respondents to most reporting forms was relatively small. The follow¬ 
ing summary table shows that the number of respondents was fewer 
than 100 for more than half of all statistical forms. An astonishing num¬ 
ber of forms was collected from 25 or fewer respondents. Only 54, or a 
little more than 1 per cent of the total, were submitted by more than 
1,000 respondents. 

TABLE I 


Number of Respondents Classified by 
Reportinic Periods 


Number of 
Respondents 

One 

Tune 

Monthly 

Quarterly 

Other 

Total 

N.A. 

6 

10 

4 

8 

27 

10 and less 

1 

31 

4 

7 

43 

11-26 

4 

46 

6 

8 

63 

20-60 

2 

63 

6 

8 

69 

61-100 

4 

60 

13 

6 

83 

101-200 

3 

48 

10 

5 

66 

201-600 

6 

48 

10 

3 


601-6000 

14 

39 

15 

9 


over 5000 

3 

8 

2 

0 

mgm 

TOTAL 

42 

343 

69 

64 

608 


Most of the questiozmaires were related to a WPB order, such as an L 
or M order. Of the 786 forms, 178 were not directly related to orders; 
the remainder, 608 forms, were related to more than 300 different oi>- 
ders. 




















467 


WFB STATISnCAli BEPOBTtNG, *a 

WFB BEFOBT8 FIIjED BT A UBBITTM-SIZED w.ak tp 

II is informative to study the quantitative characteristics of the 
WPB reporting system by analyzing the reports filed by a single indus¬ 
trial plant,* For this purpose a medium-sized plant which employed 
approximately 6,500 persons and produced electric motors, various 
types of electrical equipment and miscellaneous war products was 
selected for study. Keporting requirements for various products were 
so diverse, however, that no one plant should be considered represen¬ 
tative of all plants. 

A total of 67 reporting forms were chosen. Excluded from the study 
are comparatively insignificant and irregular reports, telephonic re¬ 
quests for data, informal letters requesting information, various au- 
thoriring forms which were not submitted directly by the plant, and, of 
course, reports filed with the Aircraft Scheduling Unit, Board of 
Economic Warfare, War Department, Navy Department, and other 
such procurement agencies, whether the form did or did not carry a 
WPB form number. Inclusion of these items would, of course, result in 
a much greater total number of questionnaires and surveys actually 
submitted by this plant. 

Table II shows the frequency throughout the war period with which 
selected categories of information were requested on the 67 forms. In 
December 1941 this plant was required to submit only 4 forms. These 
were statistical-type reports filed monthly; three covered aluminum 
data and one asked for information about copper. By the first half of 
1943 the number had risen to 37 and thereafter fell off only sl^tly. 
Throughout the war period inventory data were most frequently re¬ 
quested while data pertaining to shipments and production were next in 
importance. 

Neither these figures nor others in the tables reveal fully the volume 
and detail of infornmtion asked on individual forms. Modifications of 
old questionnaires and new schedules tended to request more penetrat¬ 
ing information, in greater volume and detail, as production and dis¬ 
tribution problems became intensified in the period from early 1941 
through 1943. Thereafter there was a general tendency in the other 
direction, but it was not strong imtil the spring of 1945. 

Table III classifies the type of information reported by the frequency 
of the reporting period. This table demonstrates clearly that when 
considering the war period as a whole most information, particularly 
detailed information, was reported on a monthly baris. 

* The authors wish to aoknowledee that Mrs. SUisabeth Joyce undertoofcf under the saperv&ion 
of the authors, the analysiB upon whi<di this seoiaon is based* 
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TABLE II 


Selected Categories of Information Requested on Fifty-Seven WFB Forms 
Filed by One Plant in Seloeted Periods of Time 


Infonnationid 

Category 

Total 

Forms 

Dupl. 

1041 

Periods of Time 

1042 

1943 

1044 

1946 

Jan. July 
June Dec. 

Jan. July 
June Doc. 

Jan. July 
Juno Deo. 

Jan. July 
June Deo. 

Capacity 

7 


n 

4 2 

2 1 

1 

Production 



IB 




Actual 

16 

3 


11 10 

9 10 

10 

Si^eduled 

10 


iB 

7 8 

8 10 

10 

Shipments 



HI 




Dollar 

8 

1 


5 2 

3 3 

3 

Unit 

23 

4 

7 10 

17 14 

12 12 

12 

Material Consumption 







Total 

13 

1 

4 7 

11 9 

6 6 

6 

By products 

4 

1 

3 3 

3 2 

2 2 

2 

Anticipated 

7 

1 

1 2 

7 4 

3 3 

3 

Receipts or Accepted 







Deliveries 

15 

1 

5 8 

13 10 

6 6 

6 

Inventory 

26 

3 

10 12 

21 19 

15 16 

16 

Orders 







Unfilled 



1 2 

6 6 


7 

New 



2 

6 4 


3 

Canceled 


■ 

1 

3 3 


2 

Material Requirements 

14 

■ 

5 7 

12 8 


7 

Labor 


■ 



■B 


Use 

5 


3 2 

3 3 


2 

Requirements 

1 

H 


1 1 

Ib 

1 

End Use Pattern 

13 

H 

1 6 

8 8 

6 6 

5 

Number of Unduplicated 


B 





Forms 

67 

B 

16 20 

37 36 

32 31 

30 


The frequency with which various typos of iufonnation were reported 
over the war period on t 3 rpes of questionnaires is shown in Table IV. 
This table shows that most information was reported on statistical-type 
forms with ^pment, production, inventory and material consumption 
data predominating. 

Table V on page 471 shows that most of the reports were very limited 
in scope. Throughout the war period the plant studied was requested 
to file the great bulk of its reports for individual products. 





























WPB STATISTICAL EEPORTING, H 


469 


TABLE III 


Selected Categories of Information Requested on Fifty-Seven WPB Forms 
Filed by One Plant by Time Frequency of Reporting 


Informational 

Total Forms 

Frequency 

Category 

Duplicated 

Monthly 

Quarterly 

One-Time 

Irregular 

Capacity 

7 


— 

2 

1 

Production 

22 


6 

1 

2 

Actual 

16 


2 

1 

1 — 

Scheduled 

10 


4 


2 

Shipments 

27 

16 

8 

3 


Dollar 

8 

1 

6 

1 

— 

Unit 

23 

16 

4 

3 

— 

Materials 






Consumption 

18 

10 

5 

1 

2 

Total 

13 

9 

1 

1 

2 

Byproducts 

4 

1 

3 

— 

— 

Anticipated 

7 

4 

2 

1 

— 

Receipts or Accepted 






Deliveries 

15 

11 

2 

1 

1 

Inventory 

26 

14 

7 

4 

1 

Orders 

15 

10 

4 

1 

— 

Unfilled 

12 

7 

4 

1 

— 

New 

6 

6 

— 

— 

— 

Canceled 

3 

3 

— 

— 

— 

Material Requirements 

14 

7 

4 

— 

3 

Labor 

2 

_ 


— 

2 

Use 

5 

1 

1 

1 

2 

Requirements 

1 

— 

— 

— 

1 

End Use Pattern 

13 

8 

1 

1 

3 

Number of Unduplicated, 






Forms 

57 

25 

10 

7 

15 


Most of the forms for wWch the company was hold responsible were 
monthly summaries. The frequency with which reports were filed is 
sununarized in Table VI on page 471. 

Conclusions should be drawn from these quantitative data only after 
examination of other aspects of the reporting problem. These studies 
become more meaningful when considered in the light of outstanding 
qualitative characteristics of the WPB reporting ssrstem. 
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TABLE nr 


Solcotod Categories of Information 3Ele4ucRtod on Fifty-Seven 'WPB Forms 
Mlod by One Plant by Character of Use 


Informational 

Category 

Total Forma 
Duplicated 

Statistical 

Statistical 

and 

Application 

Application 

Other 

Capacity 

7 

mm 

0 

0 

2 

Production 

22 


2 

6 

0 

Actual 

16 


2 


0 

Scheduled 

10 

mM 


6 

0 

Shipments 

28 



2 

1 

Dollar 

8 



2 


Unit 

23 

18 


0 

1 

Material Consumption 

18 

8 

8 

2 

0 

Totals 

13 

7 

6 

0 

0 

By products 

4 

2 

2 

0 


Anticipated 

7 

3 

2 

2 

0 

Reedpts or Accepted 






Deliveries 

15 

8 


0 

0 

Inventory 

26 

15 


1 

3 

Orders 

13 

8 


3 

1 

UnfiUed 

12 

8 


3 

1 

New 

6 

6 


0 

0 

Canceled 

3 

3 


0 

0 

Material Reqoiiemonts 

14 

2 

9 

3 

0 

Labor 

5 

3 

0 


1 

Use 

4 

2 



1 

Requirements 

1 

0 

0 


0 

End Use Pattern 

13 

4 

3 

pBBI 

2 

Number of Unduplicated 






Forms 

67 

27 

9 

1 « 

8 


GENEEAL QTTALITATIVB CHARACTERISTICS OP THE WPB REPORTING 

STRtJCTtIRB 

The WPB reportuig framework developed varioiis broad qualitative 
patterns which gave it a distinct character. The reporting structure in 
its formative years, and to a certain extent in its entire lifetime, was 
not flexible enough to meet new fundamental problems as they ap¬ 
peared. It did not have su£5.cient substance to permit the detection of 
crises before they arose. Except in connection with isolated product 
control problems the reporting structure followed rather than preceded 
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TABLE V 


Distribution by Time Periods of Company, Plant, and Product Reports 
Piled by One Plant on Hfty^Seven Forms 


Time Period 

Company 

Report 

Plant 

Report 

Product 

Report 

Total (unduplicated) 

3 

2 

62 

1941 

0 

0 

4 

1942 




First Half 

2 

1 

12 

Second Half 

0 

2 

18 

1943 




First Half 

1 

2 

34 

Second Half 

1 

2 

33 

1944 




FiratHalf 

1 

2 

29 

Second Half 

1 

1 

29 

1945 




FiistHalf 

1 

1 

29 


the changing character of needed controls. The reporting structure, 
therefore, was molded largely by improvisations generated out of the 
pressures of events. Its ability to prepare for and, by so doing, to choke 
off serious production problems was not one of its sources of strength. 

TABU! VI 

Distribution by Time Periods of Irregular, One-Time, Monthly, and Quarterly 
Reports Filed by One Plant on Fifty-Seven WPB Forms 


Time Period 

Irregular 

ja 

One-Time 

Monthly 

Quarterly 

Total 

15 

7 

25 

10 

1941 

0 

0 

3 

1 

! 

4 

1942 

First Half 

2 

2 

7 

Second Half 

4 

2 

11 

3 

1943 





First Half 

10 

2 

22 

3 

Second Half 

13 

2 

17 

4 

1944 





First Half 

14 

0 

12 

6 

Second Half 

14 

0 

12 

6 

1945 





First EUlf 

13 

0 

12 

5 
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The reporting ^srstem penetrated almost every conceivable aspect of 
industrial production activity, from the accumulation of total produc¬ 
tion data to cumbering incidentals. Moreover, WPB surveys did not 
stop with industry: they also touched farmers, housewives, retailers, 
wholesalers, financial institutions, and foreign Governments. The range 
of information sought was as expansive as the breadth of controls re¬ 
quired to channel the productive efforts of the industrial machine. 
Data requirements ranged from questions on operations of giant cor¬ 
porations to the needs of an individual farmer who wanted to buy an 
alloy steel bolt to fibc his tractor. There were actually millions of appli¬ 
cations transmitted to the WPB by individuals and small businessmen 
to purchase items on restricted lists. 

Chronologically, the main evolution of the reporting structure per¬ 
tained to individual products and materials. Not only did this pattern 
fit the general philosophy of control but it fitted also the organizational 
structure of control agencies. Throughout the war period the basic 
fabric of the reporting structure was patterned after individual product 
reporting. The superimporition of comprehensive surveys upon those 
product reports created many intricate and serious technical problems. 
Except in a few areas, coordination of comprehensive surveys and indi¬ 
vidual product reports was never fully accomplished. 

The most commonly understood characteristics of the reporting 
structure probably were the overlapping of questioimaires, incon¬ 
sistency in reporting requirements, and duplication of incoming in¬ 
formation. In the chaotic period in mid-1942, major overlapping and 
duplication was inherent in such broad and baric reports as PD-275, 
PD-25A, WPB-732, and in individual allocation schedules. Within 
each of the individual allocation systems the same type of duplication 
existed. For molybdenum, for example, a steel producer might report 
molybdenum requirements on his alloy melt schedule, PD-391; his 
consumption and stocks of molybdenum on WPB-IOG; detailed molyb¬ 
denum requirements for melting on PD-258; molybdenum consumption 
on PD-259; and present his application and obtain his allocation for 
molybdenum, pursuant to order M-110, on PD-260. Many comparable 
rituations arose from administrative expediency which did not admit 
sufficient time to eliminate such conditions through coordination. 

Throughout its existence, although in lessening degree, the reporting 
structure revealed an amaring lack of procedural unification and 
standardization. This extended from questionnaire format through 
coverage, tabulation, and informational usage. The WPB, for example, 
constantly struggled with the problem of standardiiang application 
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forms relating to L and M orders. In mid-1942 the Chemicals Division 
devised two standard forms for its orders (PD-600 for consumers of 
chemicals and PD-GOl for producers of chemicals) which eventually 
were used to implement 117 orders. This type of experience, however, 
was not common in the WPB and most orders carried their own spe¬ 
cialized forms. 

Technical methodology developed in the reporting structure gen¬ 
erally evolved through three stages. The first stage was characterized 
by difl&culties in grasping problems which reports could solve, in recog¬ 
nizing the applicability of (Jld reporting procedures, or in developing 
new procedures to solve them. In this phase, fumbling, experimentation 
and false starts predominated. Out of these experiences a second stage 
emerged in which practical methods were sorted from impractical ones 
and incorporated in recognized reporting standards. The third stage 
was characterized by an application of these methods to the problems 
at hand and the development of modifications by which they were 
made to fit better the needs of management. 

Although most methods developed along these lines it must not be 
inferred that there was a free interchange and free use of methods 
among the WPB’s personnel so that tested techniques used by others 
were readily adopted in aU areas where they were applicable. WPB 
personnel groups showed a strong resistance to accepting methods and 
procedures adopted by others. Each new problem was considered to be 
highly specialized and to demand highly specialized methods for its 
solution. Although on the surface new problems often appeared to be 
different from others which had arisen, most problems displayed com¬ 
parable fundamental characteristics. Their solution, in so far as that 
could be accomplished by statistics, thus merited review of techniques 
adopted in other areas. But such review was never sufficiently general 
to result in free interchange and mutual acceptance of common experi¬ 
ence. 

The WPB reporting S 3 rstem was considered a “step-child” by far too 
many administrators. Its behavior, except by those fostering it, was 
often viewed with suspicion. It was characteristic of WPB personnel 
to disparage the results of all surveys except their own, particularly 
when the data from each bore on a common problem. This attitude was 
probably more indicative of ignorance regarding statistical methods and 
procedures than of unjustified affection for one’s own data accumula¬ 
tion. Each person sponsoring and using information generally was 
aware of its strengths and weaknesses. But the complexity surrounding 
the collection and use of each set of data was such that generally 
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nothing short of close familiarity with all details of the data could pro* 
vide full comprehension of them. 

The reporting structure was subject to rapid changes not only in 
complete basic control systems but also form details. We have already 
seen the outlines of swift changes in broad control mechanisms. Even 
quicker changes took place vnth particular forms. Many of the one¬ 
time questionnaires appeared suddenly and without previous notice. 
Among the periodic or recurring reports rapid and unannounced data 
changes often took place. These changes were unusually swift and sub¬ 
stantial early in the war period but even toward its end sudden and 
radical alterations of old forms were frequent. In 1944 central review 
action was taken on a total of 2,841 questionnaires. Of this number 
1,933 were revMons or extendons of existing reports. Some revidons 
were of no consequence; others were so extendve as to constitute a 
completely new questionnaire. 

The reporting structure, as an instrument or tool of industrial 
management control, reflected either directly or indirectly all the in¬ 
tricate wartime production control problems. It was the impact of 
these problems on the use of reporting for management purposes, and 
the resolution, irresolution, and compromise exercised in solving them, 
that gave the reporting structure its character and direction. An ap¬ 
praisal, therefore, of the WPB reporting structure in terms of statistical 
reporting ideals cannot be realistic without a recognition of these in¬ 
fluences. 


PART m 

CONTBOLLINa THE ISSUANCE OF QUESTIONNAIBES 

Fortunately, in the earliest days of the NDAC, a central clearance 
function was established to approve or disapprove issuance of ques¬ 
tionnaires. In retrospect, the need was obvious for a centralized review 
of proposed data collections to avoid duplication, to coordinate report¬ 
ing, to maintain technical standards, and to insure administrative 
efficimicy in the use of collected data. Through central control, all 
problems relating to questionnaires were brought to a focus. 

Before a questionnaire could be approved for issuance, it was neces¬ 
sary to compromise a variety of pressures relating to its issuance. The 
range of WPB data requirements and methods was so broad and com¬ 
plex, the impact upon industry so difficult to measure, and the rela¬ 
tionship between administrative responsibility and statistical proce¬ 
dural standards so intricate, that questionnaire Issuance created thorny 
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problems which by their very nature could not be answered to the 
satisfaction of everyone concerned. On the one hand, for example, were 
the desperate appeals for clearance of requests “without which the 
WPB cannot operate.” On the other hand were complaints from 
industry that “we are producing weapons not statistics.” Compromising 
such conflicts was difficult. 

LEGAL BASIS OF CENTBAL OONTBOL OVEB FOBM ISSUANCE 

The wartime scramble for facts by Government agencies led to a 
substantial volume of Government questionnaires to industry. Wide¬ 
spread criticism of poorly devised report forms, duplication of data 
requests, unintelligible instructions, and the rapidity with which re¬ 
quests multiplied, reached a peak early in 1942. Action was not slow in 
coming. Corrective measures were forced on various Federal agencies, 
committees formed to study the problem, and various business ad¬ 
visory groups were pressed into service. The most visible result of these 
actions was the passage in December 1942 of the Federal Reports Act 
of 1942. 

The Federal Reports Act formalized responsibilities of Federal agen¬ 
cies in their data collection work and provided a basis for the Bureau 
of the Budget to control all data-collecting activities of Government. 
The act specifically directed the Bureau of the Budget: “to investigate 
the needs of the various agencies for information from business enteiv 
prises and other agencies; to investigate the methods used in obtaining 
such ioformation; and to coordinate information-collecting services in 
order to reduce the cost of the Government and minimize the burden 
upon business and industry and utilize, so far as practicable, files of 
available infomuition and existing facilities of the established Federal 
departments and independent agencies.” 

The NDAC in largo measure had anticipated the Federal Reports 
Act of 1942 and by so doing kid prevented a far more chaotic situation 
than that which existed in the early war period. At the first meeting 
of the Advisory Commission to the Council of National Defense, held 
on June 12, 1940, it was stated that “experience had already demon¬ 
strated the necessity for establishing a central office through which all re¬ 
quests of the Commission for information would be cleared, that a plan 
was being prepared, and that a tentative su^estion as to the oiganiza- 
tion of such an office would be presented to the Commission not later 
‘than Saturday, June 15, 1940.” In the summer of 1940, before the 
NDAC began to collect data from its own questionnaires, the Division 
of Statistics of NDAC, jointly with the Bureau of the Budget, studied 
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the reporting experience of the War Industries Board in World War I. 
Analysis of problems faced by the War Industries Board (which did 
not have formal centralized form control) led to the recommendation 
that the NDAC should create a central clearance point for data requests 
originating within the Commission. A control office was created but it 
operated informally and had no power to compel compliance with its 
decisions. 

Besults achieved by this office in coordinating data requests and in 
improving the character of questionnaires issued, however, led to more 
formal authority. This was established, after the 0PM absorbed 
NDAC, by the issuance of 0PM Administrative Order No. 4. 

Administrative Order No. 4 required every office of the 0PM to 
obtain the approval of the Division of Statistics before any formal in¬ 
quiry could be sent to industry. WPB General Administrative Order 
No. 6, issued in February 1942, carried this clearance requirement 
through the new WPB. Criteria set forth in the order to govern the 
review of questionnaires are worthy of note. They were; 

“1. That the information to be requested is needed at the time it 
is to be ffied and that the need justifies the effort and expense 
on the part of both industry and government in order to 
obtain it. 

2. That the data are not already being secured by any other 
federal agency, or, if being secured, are not and cannot be 
made available to the War Production Board. 

3. That the request includes clear and si>ecific definitions of the 
required data. 

4. That the required report is adapted as closely as possible to 
the types of records ordinarily maintained by business con¬ 
cerns or available to them. 

5. That staff and equipment are available to tabulate or other¬ 
wise process the returns when received.” 

The Office of Survey Standards was established in the Division of 
Statistics to carry out the provisions of this Order. Passage of the Fed¬ 
eral Reports Act of 1942 necessitated only one revision m WPB pro¬ 
cedure : each report form in use was given an approved number assigned 
by the Bureau of the Budget. 

There was continued in the 0PM, from the NDAC, an Office of 
Procedures which also was responsible for the review of questionnaires. 
The Office of Procedures carried broad responsibility for reviewing all 
administrative procedures relating to all plans, programs, orders, and 
r^ulations proposed for issuance [by the 0PM. This Office carried re- 
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sponsibilities in administrative procedures comparable with those of 
the Office of Survey Standards in questionnaire methods and stand¬ 
ards. As part of its operations the Office of Procedures reviewed ques¬ 
tionnaires in terms of administrative practicability, relationships with 
control programs, and other procedural matters. In exercising this func¬ 
tion there was a clear demarcation from functions of the Office of 
Survey Standards. The functions of both, however, shaded into each 
other. 

This overlapping proved to be a source of strength rather than 
weakness. Nevertheless demarcation was required. In April 1943 a 
working agreement was reached to formalize previous informal arrange¬ 
ments. It was agreed that the Office of Survey Standards would confine 
its review of data requests to “statistical problems, such as: (1) review 
of the kind of data to be secured; (2) the form in which such data was 
to be secured, e.g., pounds, feet, tons, gallons; and (3) the instructions 
for filling in the requested data.” On the other hand, the Office of Pro¬ 
cedures confined its review to “administrative impact,” that is: (1) 
the administrative necessity for the form, e.g., if the method of control 
provided in the order was questioned, the form implementing such 
control ipso Jacto was questionable; (2) the possibility of consolidation 
of forms; and (3) instructions covering the number of copies of forms 
which were to be filed, with whom filed, and so on. Broadly, the former 
matter was statistical and the latter procedural. 

This agreement crystallized a procedure which had existed long be¬ 
fore. Data requests followed a simple path. First, the Office of Survey 
Standards reviewed new questionnaires. If the questionnaires wore 
approved they moved to the Office of Procedures for review against 
existing and efficient administrative procedures. Since our interest is 
primarily statistical rather than procedural the remainder, although 
not all, of this section concerns operations of the Office of Survey 
Standards. The concept of “centralized questionnaire control” docs, 
however, include operations of both groups. 

CLEARANCE PROCEDURE 

Rudimentary principles of efficient action for clearance of data re¬ 
quests were so well recognized by those familiar with the problems in¬ 
volved that no serious question of clearance procedure arose. Varying 
circumstances surrounding the adoption of each new reporting pro¬ 
posal, however, often resulted in short-cuts to established lines of ac¬ 
tion. Nonetheless, most inquiries went through much the following type 
of critical review. 

The bulk of data requests (over 90 per cent) originated in the WPB 
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Industry and Material Divisions. The divisions were staffed for the 
most part with businessmen fresh from industry, who were presumed 
to have some knowledge of the data needed for control and of the 
ability of their industiy to supply the requested information. These 
businessmen had help from professional statisticans assigned to each 
division, who were presumed to have technical skill in collecting and 
using data. The idea that this first screening would result in a mini¬ 
mum of ill-conceived questionnaires, however, was not supported by 
the facts. 

After these preliminaiy reviews the data request was submitted to 
the Office of Survey Standards. The transmittal folder contained a 
transmittal letter; a memorandum of justification; copies of the pro¬ 
posed questionnaire; and other necessary papers such as processing 
directions, printing requisitions, mailing instructions, and so forth. The 
Office of Survey Standards was required to approve or disapprove the 
data request within three days of submission, a time schedule not al¬ 
ways met, or to notify the initiating unit of objections. Copies of the 
form, memorandum of justification, and other papers were routed to 
the Bureau of the Budget during this three-day period to notify the 
Bureau of the proposal and to allow concurrent review by the two 
Offices. If it became necessary to consult the unit originating the report, 
a representative of the Bureau of the Budget was invited to be present 
to prevent unnecessary repetition of interviews. 

When a data request was approved by the Office of Survey Stand¬ 
ards, a formal request for approval was made to the Bureau of the 
Budget under the provisions of the Federal Reports Act of 1942. Be¬ 
cause of the practice of concurrent review, there were probably not 
more than a half dozen cases in which the Bureau of the Budget dis¬ 
approved a form after formal request for approval by the Office of 
Survey Standards. Decisions to modify or disapprove data requests 
were ^cussed with the originating unit to roach agreement on issues 
involved. In most instances this practice led to compromises acceptable 
to the parties concerned. But if no compromise agreement could be 
reached, and where the originating unit could not be convinced of the 
need for withdrawal of the request, no alternative was left but to set 
forth formally the reasons for rejection. 

Havit^ obtained approval by the Bureau of the Budget for a ques- 
tioimaire, the Office of Survey Standards affixed the Budget Bureau 
approval number, signed the print requisition, and forwarded the final 
draft of the form and the printing and mailing requidtions to the Office 
of Procedures. That Office, which had charge of coordinating pro- 
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cedures used by various divisions, was consulted in advance by the 
Office of Survey Standards if a proposed data request implied a chaise 
in established procedure. Consequently, obtaining written approval of 
the Office of Procedure was a fairly simple formality. The Office of 
Procedures, after review, routed the documents to the Printing and 
Distribution Control Division which had jurisdiction over the layout, 
printing and bulk distribution of all WPB forms. 

This general review procedure altered little during the war period. 
This was partly due to legal restrictions, partly to a general recognition 
of the soundness of the procedures, and partly to the fact that the 
procedures were flexible enough to operate effectively in a rapidly shift¬ 
ing reporting structure. From time to time, however, this procedxure 
was supplemented by the work of special committees. 

COMMITTEE FOB THE REVIEW OF DATA REQUESTS FROM im>USTRT 

At no time during the war period was there an absence of strong 
industry criticism of WPB forms. WPB data requests constituted a 
burden on industrial operations. One of the first efforts to lighten the 
burden was begun in Jimo 1942 by a Committee for the Review of 
Data Requests from Industry. 

This committee, composed of representatives from business, the 
Army, the Navy, and the Bureau of the Budget had wide powers to 
correct shortcomings of WPB questionnaires. The establishment of 
the Committee, and its system of sub-committees and industry ad¬ 
visors, did not conflict with past clearance procedure. Rather, it 
created a “Court of Appeals” which constituted a new clearance hurdle. 

The Committee had the support of all responsible executives in the 
WPB and all others intorosted in improving the elaborate and extensive 
questionnaire structure created by the WPB. Each WPB division re¬ 
sponsible for dats) requests was asked by the Committee to review its 
questionnaires and to supply the Committee with a statement of repet¬ 
itive report forms in use showing: those which should be continued, 
those which should be modified, and those which could be discontinued. 
An elaborate system of subcommittees was created to review various 
tjTpes of quostiozmairos. Industry representatives were invited to visit 
Washington and, in collaboration with the Committee, to make a re¬ 
view of each repetitive reporting form. Further, the Committee solic¬ 
ited criticism and suggestions from over 1,000 trade associations and 
their members in a drive to ferret out unnecessary duplication, to define 
areas where consolidation of reports could be effected, and to restudy 
the reports which caused difficult filing problems. In response to this 



480 AMEBICAN STATISTICAL ASSOCIATION JOXTBNAL SEFFEMBEB 1B48 

request, a total of approximately 40,000 letters were received in mid- 
1942 embodying the important complaints, criticisms, and suggestions 
of industry. 

The vast energy expended and the calibre of talent used produced 
disappointing results. These were made the poorer by exaggerated 
claims of success by the Conunittee. The Committee itself was responsi¬ 
ble for relatively few revisions or discontinuances. Most revisions and 
discontinuances claimed had been planned by the WPB divisions be¬ 
fore the Committee was established. Pressures generated by industry 
criticism and the existence of the Committee no doubt hurried the 
process but the fact remains that relatively few revisions or discon¬ 
tinuances resulted from the literally blinding light focused on each 
inquiry. These facts support the conclusion that the complex and bur¬ 
densome reporting system was the result not of poor questionnaire 
control, inadequate policing, or the absence of self discipline, but rather 
of confused basic WPB policies. And despite the vast reporting struc¬ 
ture, there is reason to believe that the WPB erred on the side of too 
little information rather than on too much. 

The quality of industry’s critical response was also disappointing. 
Analysis of the thousands of letters from industry demonstrated that 
the burden of the WPB requests on industry, and industry objection to 
reporting forms, centered almost exclusively on the basic material re¬ 
port forms used under the individual allocation systems. The principal 
difficulty of industry in filing these forms resulted from the request for 
end use information which industry neither had nor could readily get. 
Of the hundreds of forms in use, only a handful outfflde of the raw 
material control area were the subject of industry comment. The 
trouble lay not in the report forms themselves but in the material 
control systems which they implemented. Most of the report forms in 
use were needed to facilitate operations under the various overlapping 
and inconsistent control systems in force. Before order could be brought 
into the reportii^ system there had to be a definite policy established 
for control of material flows. The coordinated efforts of industry in 
criticizing the WPB questionnaires added practically nothing to what 
was already known to statistical technicians and the top staff of the 
WPB. 

Even though the Committee was less successful than its claims, it 
performed a valuable function. It focused the attention of burinessmen 
both in the WPB and in industry on the problems of wartime informa¬ 
tional requirements, especially the character of the information needed 
by government and the ability of industry to supply the data. As a 
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result a decision was made to establish on a pennanent basis the ma¬ 
chinery through which the viewpoints of industry could be represented 
in the issuance and continuance of the WPB’s reporting forms. 

In October 1942 the permanent position of Chairman of the Commit¬ 
tee on Data Requests from Industry was established and filled by a 
business man. This function continued well into 1944 when, as a result 
of a sharp decline in the introduction of new data requests, the position 
was abolished. 

ADVISOBT COMMITTEE ON QOVEBNMENT QUESTIONNAIBES 

In addition, by a letter dated October 9, 1942, the Director of the 
Bureau of the Budget requested the President of the Controller’s 
Institute of America to establish an Advisory Committee on Govern¬ 
ment Questionnaires composed of five members, one representing each 
of the following industry organizations: the National Association of 
Manufacturers, the American Trade Association Executives, The Con¬ 
trollers’ Institute of America, the Chamber of Commerce of the 
United States, and the American Retail Federation. The Committee 
was asked to take steps necessary “to make the advisory services 
of business available to Government agencies ... in promoting the 
improvement, development and coordination of all federal statistical 
services.” The Committee, later expanded to include representation 
from the National Association of Commercial Organization Secre¬ 
taries, and the National Industrial Council, was independent of gov¬ 
ernment and responsible only to the business community. Its functions 
were purely advisory. 

The Committee operated in large part through subcommittees. 
Some groups restricted their work exclusively to a single industry, as 
for example quest iormaire problems of the arc welding industry. In 
other cases, subcommittees represented production segments of an 
industry, such as the organic chemical industry, or a functional part of 
an industry, os for example problems of accounting in questionnaires of 
the public utility industry. In still other cases subcommittees devoted 
their attention to general problems that affected many lines of buaness, 
as for example industry’s problems in filing WPB-732. Subcommittees 
met regularly and devoted themselves to the intimate details of report¬ 
ing problems. Panels were created in emei^encles to conmder nonre¬ 
curring problems. Exclusive of such panels, 58 subcommittees were 
appointed during the first year of the Advisory Committee’s existence. 

The Advisory Committee performed a valuable service to govern¬ 
ment, a service not measurable in quantitative terms. During its first 
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year of operation some 300 specific recommendations were made to the 
Bureau of the Budget and substantially all wore accepted both by the 
Bureau and the agencies whose programs were affected. There were 
also hundreds of unrecorded instances where questionnaires wore volun¬ 
tarily revised and substantially improved by persons and agencies 
affected before their original issuance. In addition, as noted by the 
Director of the Bureau of the Budget in a letter to the Chairman of the 
Advisory Committee on Government Questionnaires, dated December 
3, 1943, its existence encouraged greater care in the development of 
governmental forms and greater attention to the abilities of respondents 
to supply the information requested. Although these considerations 
applied to all Government agencies, the bulk of the work of the Com¬ 
mittee pertained to questionnaires of the WPB since its surveys loomed 
so large in the total of all Government inquiries. 

TASK COMMITTBB ON BEDTTCnON OF SPBCUL AFFIiICATIOIfB 

In mid-1943 renewed industry complaints about the WPB’s informa¬ 
tional demands coincided with a general over-ail improvement in sup¬ 
ply-demand relationships in many basic materials and intermediary 
products. These circumstances made it difficult to visualize the need 
for continuing many reports. A Task Committee on Heduction of Spe¬ 
cial Applications was therefore established by the Operations Vice 
Chairman and the Program Vice Chairman to determine the TninimuTn 
number of special applications that should be required from industry. 
Primary responsibility for a general housecloaning was placed upon the 
WPB Division Directors. Since the Chairman of the Task Committee 
was also the Deputy Vice Chairman for Operations with full authority 
to act when a decidon was made, prompt results were expected. 

A reduction of 40 per cent in the applications required from industry 
was set as the objective to bo achieved through the following methods: 

"1. Elimination of allocation or special authorization procedures 
wherever supply warrants. 

2. Substitution of limitation orders or restrictive provisions in 
orders for those which now require special applications. 

3. Small order exemptions to be incorporated in the existing or¬ 
ders so that not less than 10 x>er cent of supply will be dis¬ 
tributed in this manner unless conclusive evidence is given 
justifsdng a lower percentage. 

4. Replacement of consumer applications by suppliers’ order 
board reports. 

5. Substitution of quarterly applications for monthly applica¬ 
tions. 
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6. Substitution of quotas for special applications (including MB.O) 
now required wherever consumption and use have been estab¬ 
lished.” 

Coincident with work on weeding out unnecessary application forms, 
the Task Committee was to attempt standardization wherever possible, 
particularly as to end-use information; to ask only for facts readily 
available to manufacturers and eliminate those questions which manu¬ 
facturers had to answer from supplier or customer records; and to re¬ 
duce wherever possible paper work other than questionnaires. 

COMMITTEE ON DEMOBILIZA.TION OF CONTBOLS AFTEB V-J DAT 

In the summer of 1944, when the collapse of Cerman resistance 
seemed to bo imminent, the pace of reconversion preparations, which 
had been in progress from as far back as the spring of 1943, accelerated 
quickly. Leading the reconversion plans was a Committee on De¬ 
mobilization of Controls after V-E Day, familarly known as codcave. 
When the Battle of the Bulge in the winter of 1944-45 shifted attention 
from reconversion to renewed war-production, a new Committee on 
Period One (CPO) was established in February 1945 to review the 
program developed by codcave and keep it up to date. This Commit¬ 
tee functioned imtil Y-J Day. When Germany collapsed it had a com¬ 
plete program for relaxation of controls ready to be put into effect at 
once. Despite the unexpectedly short period between V-E Day and 
Y-J Day, it had a similarly complete plan when the Japanese surren¬ 
dered. 

The fundamental concern of these Committees, of course, related to 
baffle production and distribution controls. But accompansdng those 
controls were reporting instruments. Plans concerning questionnaires 
were developed by a Sub-Committee on Reports and Data Requests. 
As a part of the work of the principal Committee, every control policy, 
order and regulation was reviewed by the subcommittee. 

The principal objectives which the subcommittee established in re¬ 
viewing questionnaires are worthy of note. First, of course, was the 
desire to eliminate all non-essential reports growing out of relaxation 
of basic controls and management needs. Second, it was decided to 
maintain an intelligent reportii^ program which would permit the 
WPB and other war and reconversion agencies to be properly informed 
and prepared to take any necessary action in the reconversion jreiiod. 
Third, it was considered important to preserve in an uninterrupted 
series all reports which could be regarded as having post-war value to 
penxument Govormnent agencies. In contrast with the approach to 
comprehensive elimination of restrictive controls, it was felt that in re- 
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vising its reporting roqniromonts following V-E Day, the WPB should 
err on the side of caution rather than over-hasty relaxation. The ex¬ 
perience of five years of struggle in building up a ropoi*ting system 
and in fully understanding the strategic role which it played in manage¬ 
ment was not lost. It was considered to be good policy for the WPB 
to keep its “steering gear and brakes in good condition,” a function 
which could be performed by the maintenance of minimum reporting. 

Preparations made on the basis of these principles resulted in a sub¬ 
stantial reduction of questionnaires upon the arrival of V-E Day and a 
further substantial reduction on V-J Day. The reductions were orderly 
and resulted in the maintenance of reporting instruments which proved 
to be of great value in the post-war reconversion period. 

CENTRAL REVIEW PROBLEMS AND POLICIES 

The core of the central review problem rested in the effort to satisfy 
all competing pressures present at the point of central review and at 
the same time gradually raise technical levels of statistical proficiency 
and sound operating procedures. Each data request originally had to 
be reviewed not only as a technical and professional problem but also 
in the light of a wide variety of complex infiuences growing out of the 
nature of the WPB organization, the personalities of administrators, 
the status of various control mechanisms, the degree of statistical co¬ 
ordination present, the urgency of data requirements, present and pro¬ 
spective control policy, industry's ability and willingness to comply, 
the nature of other comparable questionnaires and literally dozens of 
other such political, administrative, and technical issues. And once a 
decision was made it could not be forgotten. Review of past actions was 
a necessary accompaniment to original control, for the WPB divisions 
often maintained and supported questionnaires whose usefulness had 
ended or whose eflSciency could be improved by the application of newly 
tested technical methods. 

Fundamental in the policy of form clearance was the idea that better 
results would be obtained in the long run if emphasis wore laid not so 
much upon rigid adherence to fixed procedures as upon the personalities 
concerned with questionnaire formulation and data use. WPB's per¬ 
sonnel was a heterogeneous group of administrators, with varying de¬ 
grees of background in methods of data collection and use, and with 
substantial authority and responsibility in the management of in¬ 
dividual products and materials. It was natural for these individuals 
to insist that, upon the basis of their knowledge of their own indus¬ 
try and war needs, a survey developed by them not only was necessary 
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but also administratively practical. It was hard for central clearance 
to argue against the feeling that “it is necessary to have this informa¬ 
tion in this way to fulfill the responsibilities of this office.” Beconcilia- 
tion of such ideas with technical standards and industry’s ability to 
comply was often not easy. 

The approach to such questions was conciliatory. The goal was to 
nurture acceptance of the idea that questionnaire clearance existed to 
aid administrators to solve reporting problems, to harmonize their 
methods with others in the WPB, and to utilize experience of others in 
making the solution of their problems easier. It was a policy of aid 
rather than dictation. 

Such policies were effective. They generally engendered a feeling of 
reason into discussions of problems which otherwise might have been 
settled arbitrarily with resulting appeals to higher authority and bitter 
conflict. 

Neither administrators nor industry advisory groups were always 
thoroughly familiar with the record-keeping practices of industry from 
which data requests had to be filled. There were many instances of 
violent critical reaction from industry to forms which had been ap¬ 
proved by the WPB and industry advisory committees. In some in¬ 
stances forms had to be withdrawn because industry was in no portion 
to respond. In others, special “Task Groups” had to be established to 
simplify questionnaires before the great majority of re^ondents could 
be expected to comply. There were many instances in which those who 
should have had knowledge of their industry’s ability to file reported 
information were responsible for data requests which could not be filed. 
Often, against the limited knowledge of sponsors of questioimaires, 
clearance had to array information concerning the ability of industry 
to comply. 

In addition, of course, wore the problems of matching the proposed 
questionnaire against technical standards, acceptable statistical 
methodology, the WPB’s control policies, and sound administrative 
procedure. From the beginning it was accepted practice in the clearance 
function to assume that those responsible for policy and administrative 
action should themselves define their information needs. In view of this 
policy, form clearance devoted itself to solving those problems which 
might jeopardize the collection, tabulation, and use of required sta¬ 
tistics. But this was not always easy and the line between conciliation 
and dictation often was blurred. 

Important in the functioning of centralized form clearance was the 
adoption of technical statistical standards and methods to current 
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questioimaire problems. The form clearance problem was one of 
watching experience throughout the WPB and applying that experience 
to the acquisition of information needed for management. The history 
of form clearance in this regard was an uphill climb in instituting ac¬ 
cepted methods in connection with the use of “cut-off” points, stand¬ 
ardization of forms applicable to general orders, geaiing data requests 
to industry record-keeping practices, eliminating “interesting figures” 
not essential for the purposes of the questioimaire, and generating a 
freer fiow of information from one division to another to avoid duplica¬ 
tion and overlappii^ of data requests. Many desirable technical meth¬ 
ods could not be introduced arbitrarily but through a process of edu¬ 
cation their incorporation resulted in a gradual improvement in ques- 
tioniudres. As experience brought with it assurance of lightfulness 
there were many technical standards applicable to surveys and ques¬ 
tionnaires that had to be met before form issuance could be approved. 

Relating reporting to policy raised a dual problem. On the one hand, 
when a policy was established which resulted in questionnaires with 
which industry could not comply an effort had to be made to alter the 
policy. On the other hand, when a policy was established and a ques¬ 
tionnaire contradicted that policy the questionnaire had to be modified 
or rejected. Implications of these problems were far-reaching. 

Policies connected with early individual allocation systems illustrate 
the first type of problem. As discussed previously, most of the early 
allocation systems relied heavily upon acquisition of information which 
it was often exceedingly difficult and sometimes impossible for indus¬ 
try to report. The problem was attacked in a number of ways. First, 
an effort was made to devdop in the WPB an understanffing of the dif¬ 
ficulties in industiy in reporting end-use information. Increaring under¬ 
standing eventually led to requests for types of end-use information 
generally available in the records of respondents. As a result, many 
so-called end-use systems developed into use systems. In many areas, 
however, as noted elsewhere, such results were not achieved. Second, an 
effort was made to prevent the rise of numerous new allotment systems 
which conflicted with those in use. Although form clearance did not 
have authority to correct or initiate fundamental policy it did have 
authority to disapprove new questionnaires. Judicious exerdse of this 
power resulted in discouraging many new allotment schemes. Third, 
strenuous efforts were made to eliminate duplication and overlapping 
of reporting by the modification of existing questionnaires supporting 
the systems. The eventual effect was to crystallize control policy and 
coordinate many individual allocation ^srstems into workable patterns. 
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I'yiiig reporting to established policy was often more difficult when 
basic policy was itself confused and overlapping. In such circumstances, 
relating questionnaires to policy was not a simple undertaking. The 
coterminous existence in 1942 of the Production Requirements Plan 
and the many individual allocation ^sterns is a case in point. Consid¬ 
erable conffict and duplicationexisted among these systems and provided 
a fertile field for form clearance to launch a unification program. Effec¬ 
tive corrective measures, however, were impossible in the absence of 
cleaiwiut top policy. Means for taking some corrective measures came 
only with clarification of basic metal allocation policy upon the an¬ 
nouncement of the Controlled Materials Plan. 

Developments relatii^ to scheduling of components provide another 
illustration of the way in which reporting standards were sometimes 
subordinated and thwarted by ill-defined basic policy. Before the 
adoption of M-293 (the Ceneral Scheduling Order) various WPB divi¬ 
sions had instituted action to request manufacturers of fabricated com¬ 
ponents to report their individual orders. The purpose of such ques¬ 
tionnaires was to place the WPB in a position to direct shipment of a 
specific item to a specific customer. Such a procedure not only was con¬ 
trary to the basic policy of the priority system as it then existed but 
necessitated more personnel to handle the mass of data which would be 
collected than the divisions possessed. In addition, any benefits which 
might have been obtained were small when compared with the tre¬ 
mendous reporting burden such questionnaires would have created for 
industry. With few exceptions, these requests were therefore disap¬ 
proved. 

Issuance of M-293 early in 1943 altered WPB basic policy and also 
clearance policy with respect to scheduling forms. Central clearance 
recognized the fact that the mass of information required by M-293 
could hardly be used in a fashion to justify the heavy reporting burden 
imposed on industry. But central clearance, in the face of firm WPB 
policy, was helpless to disapprove the questionnaires for such reasons. 

Estimates of the impact of WPB questionnaires on industry were 
always considered in the review of questionnaires. The establishment 
of committees to study the problem has already been discussed. Be¬ 
yond the problem of eliminating needless questionnaires lay Ihe more 
intricate problem of adapting questionnaires themselves to fit industry 
records. Interest in this problem was not entirely the result of concern 
for industry. By facilitating industrial response the WPB could obtain 
more accurate, complete, and prompt re^onse to data requests. 

Attention had to be given to programs of other war agencies and the 
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utilization of other agencies in both the mailing and tabulating of ques¬ 
tionnaires. The clearance group sought to coordinate, directly and 
through the Bureau of the Budget, the WPB reporting program with 
that of other Federal agencies. In addition, arrangements were made 
with the Bureau of Mines, the Bureau of Labor Statistics, the Bureau 
of the Census, and other agencies to modify their regular reports to 
meet special needs of the WPB. In some cases, WPB orders made man¬ 
datory the filing of another agency’s report since that was the easiest 
method to obtain necessary information. 

Of much importance in the WPB’s operations was the use of other 
agencies in collecting and tabulating WPB reports. These agencies 
made available to the WPB technical skills in coding, editing, and tabu¬ 
lation questionnaires. Arrangements for acquiring these services were 
made through the clearance group. 
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Oscar Krisen Buros 
Rutgers University 


Experimental Designs in Sociological Research. F. Stuart Chapin (Professor of 
Sociology, Chairman of the Department, and Director of the School of Social 
Work, University of Minnesota, Minneapolis, Minn.). New York 16: Harper & 
Brothers (49 East 33rd St.). 1947. Pp. xi, 206. $3.00. 

Review by O. Kempthornb 
AsBociate Professor of Statistics^ Iowa State College 
Ames, Iowa 

I N the introduction the author states “the newly developed research meth¬ 
ods of experimental design, applied in the community situation, seem to 
point the way by which we can overcome, in the course of time, some of the 
complexities of social interaction that have hitherto baffled rational and 
objective description of certain acute problems of human relations.” There 
can be no disagreement with this statement; and, in fact, one could well take 
the viewpoint that until the general methods of experimental design as 
applied, for example, to biological research are part of the approach to the 
investigation of social situations very little progress can be made. It is un¬ 
fortunate then that the author has gathered together in this book, as note¬ 
worthy examples of the use of experimental designs^in sociological research, 
nine studies, only one of which would satisfy the elementary basic require¬ 
ments of a modern experiment. This statement will be dealt with later in the 
review. First, however, it would be well for the reviewer to state that he 
realizes the difficulties which arise in any attempt to apply to sociological 
research the principles of experimental design as laid down for example by 
R. A. Fisher in The Design of Experiments, It is very easy for the agronomist 
to divide his experimental area into plots to which treatments are applied 
at random, whereas the division of a community into several parts for ran¬ 
dom application of sociological treatments, such as varying types of housing, 
is extremely difficult. It is possible to visualize the application of sociological 
treatments to communities selected at random, though the pressure of public 
opinion may prevent research workers from adopting this procedure for any 
important problems. Professor Chapin discusses these difficulties at some 
length throughout the book, and the biologist or agronomist can only feel 
happy that his problems are so much simpler. 

The chapters of the book are entitled: (1) Natural Social Experiments by 
Trial and Error; (2) Three Experimental Designs for Controlled Observation; 
(3) Cross-Sectional Experimental Design; (4) Projected Experimental De¬ 
sign: The Classical Pattern of “Before” and “After” Experiments That Oper- 
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ate from the Present to the Future; (5) Ex Post Facto Experimental De¬ 
sign: From Present to Past; (6) Sociometric Scales Available for Control 
and the Measurement of Effects; and (7) Some Fundamental Problems and 
Limitations to Study by Experimental Designs. This is an imposing array 
of chapter headings with four different types of experiments mentioned, 
natural, cross-sectional, projected, and ex post facto. The question arises 
immediately of what the author understands by the words “experiment” 
and “e3q)erimental design.” The word “experiment” has been used presum¬ 
ably for centuries to describe the taMng of observations according to some 
plan, though in the biological sciences it now connotes the application and 
comparison of various different treatments or technical procedures. The 
necessity for clear thinking on this point has been emphasized in a recent 
paper* where the author differentiates between two types of experiment, 
the one for example to measure an absolute constant such as the velocity 
of light and the other for example to evaluate the effect of a special dietary 
supplement. The usage of the word “experimental design” is well established 
by now to mean a plan for performing a comparative experiment. This implies 
that various treatments are actually applied by the investigator and are not 
just treatments that happened to have been applied to particular units 
for some reason, known or unknown, before the “experiment” was planned. 
This condition rules out practically all of the experiments and experimental 
designs discussed by the author. This so-called “natural” experiment should 
be described as a phenomenon and has little inductive value; for it is well 
established that the same set of environmental conditions will not produce 
the same effect in all cases. The “cross-sectional” designs discussed have a 
superficial appearance of validity, and would be completely valid if (and only 
if) a randomization procedure had been adopted in assigning treatments. 
The fact that pairs of individuals have been matched on several control 
factors does not mean that they are identical: matching of experimental 
units is the normal procedure in all types of experiment, but is useless 
without randomization and replication by which the closeness of the match¬ 
ing may be estimated. Experimental errors are frequently quoted in the 
book when there is neither randomization nor replication of the treatments. 
The author appears (Chap. 7 and elsewhere) to be aware of these considera¬ 
tions, which if given their proper weight would have resulted in the exclusion 
of practically all the studies on the groimds that they are examples of the 
misuse of basic concepts and are incapable of unbiassed interpretation. The 
chapter on “Projected Experimental Design” contains the only one of the 
nine studies in which the principle of randomization was used, namely Dr. 
Eeuben Hill’s study on “Staff Stimulation to Social Participation and Social 
Adjustment.” This study was not, however, entirely satisfactory because of 
the population of 306 freshmen considered, there were only 171 for whom 
complete records were available. It appears that what the author terms a 
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projected design is satisfactory if the concepts of randomization and replica¬ 
tion are used properly. The reviewer fails to understand how the “Ex Post 
Facto Experimental Design” can be given a logical justification. An example 
of this “design” was the determination of the relationship of public housing 
in New Haven to juvenile delinquency, in which the earher records of 317 
families who had lived for a few years in one of the Developments of the 
New Haven Housing Authority were compared to their records during these 
few years. Some of the difficulties in the interpretation of such data are 
obvious to anyone experienced in the analysis of data and there is no need 
to discuss them. 

There appears to be some confusion in the statistical literature on the 
differences between experimental designs and survey designs. Both involve 
patterns by which observations are taken, but in experimental designs the 
objective is to estimate the effects of applied treatments and in survey work 
the objective is to estimate certain characteristics for a particular defined 
population. There are possibilities of using experimental design patterns in 
survey work, as for example the use of deep stratification with a Latin 
Square. Again, if the purpose of the survey is to establish the existence of 
associations in the population, a sample that is balanced on some factors may 
be useful. Causal relationships are frequently inferred from such associations, 
but they cannot be regarded as having the sound scientific basis of inferences 
drawn from experimental data. 

Apart from the above objections which are in themselves sufficient to 
make the book unreliable if not dangerous for workers in social sciences, 
there are very many instances of imprecise thinking and writing. Two ex¬ 
amples are sufficient evidence: “The conventional tests of the significance of 
sampling are based upon the theory of random samples, and in the present 
stage of experimental work, as we have attempted to show, it is the terminal 
homogeneity and purity in the sample, rather than initial representativeness 
of heterogeneity, that is important in demonstrating the real relationship 
between the two variable factors” (p. 104), and “Furthermore, since the 
critical ratio of 2.90 is computed from a difference between extremely pure 
samples, its significance is much enhanced by this fact” (p. 107). (The “criti¬ 
cal ratio” appears to be Student's “t”.) 

The author in spite of numerous references and discussions of the question 
does not appear to understand the concepts of prediction and causation. 
Mathematical economists have for some time differentiated between the two 
problems of predicting events and estimating structural relationships. The 
interpretation of correlation coefficients appears to the reviewer to be par¬ 
ticularly vague. 

Further general criticisms of the book are that the author takes as the 
fundamental rule of the experimental method the varying of only one condi¬ 
tion at a time with all other factors maintained constant (p. 1), and there is 
lack of clear discussion of the formulation of hypotheses in an unambiguous 
way before looking at the experimental data, this formulation of course in- 
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eluding the definition of some population for which the conclusions are to be 
relevant. 


Principles of Medical Statisticsi Fourth Edition. A, Bradford Hill (Professor of 
Medical Statistics and Director of the Department, London School of Hygiene 
and Tropical Medicine, University of London, London, England). London 
W.C.2: Lancet Ltd. (7 Adams St., Adelpbia), 1948. Pp. xi, 252. 10s. 6d. 

Review by J. Yerushalmt 
Professor of Biostatisticsj University of Cedifomia 
Berkeley, CcMfornia 

M any workers in clinical and preventive medicine have found the previous 
editions of this book the most satisfactory source for acquiring a work¬ 
ing knowledge in statistics and an understanding of the role of quantitative 
methods in medicine and public health. The publication of the fourth edition 
is evidence at once that the demand for knowledge in this field is present in 
an increasing number of members of the medical professions and that this 
book is singularly effective in satisfying this demand. 

Of greatest interest to professional statisticians are the techniques of 
presentation which accomplish the difficult task of making statistical meth¬ 
ods palatable to a nonmathematical audience. These techniques, in the 
opinion of this reviewer, derive their effectiveness from a sound evaluation 
by the author of the intellectual level of the readers for whom the book is 
intended. It is apparent that Professor Hill addresses himself to an audience 
of intelligent and capable persons in their own fields of endeavor, even if they 
are lacking in mathematical training. Consequently he does not “talk down” 
to them; rather he introduces the statistical concepts in the framework of 
medicine and public health with which his readers are familiar. He thus 
succeeds in developing the statistical techniques without the use of mathe¬ 
matical symbols and formulae in such a way that the rationale behind them 
is comprehended. The concepts become acceptable as logical entities although 
no mathematical derivations are presented. 

One minor exception to the generally lucid and comprehensive method of 
presentation may be mentioned. In discussing the standard error of the 
difference between two sample proportions Professor Hill presents the usual 
two formulae, one in which an estimate of the universe proportion is ob¬ 
tained from a combination of the two samples, the other in which individual 
proportions of the two samples are used in the determination of the standard 
error of the difference. The explanation of the later technique, however, is 
presented in terms of two different estimates of the standard error of the 
proportion in the universe (pp. 111-112). It would probably be more easily 
acceptable to the reader if the hypothesis of a single universe was not retained 
in the second approach. 
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In this fourth edition Professor Hill has substantially elaborated and 
strengthened many of the weaker points of the previous editions. Tabular 
and graphic presentation and the description and calculations of averages 
is more adequately covered. Sampling is discussed more adequately and the 
concept of normality is presented more clearly. The development of the life 
table and its functions is particularly lucid. The chapters on sampling, aver¬ 
ages, proportions, differences, and chi square are enriched by numerous 
examples, many of them in the field of public health. There are also additions 
in the number of the very instructive examples of fallacies, difficulties, and 
problems of selection. 

In spite of the additions and elaborations of so many topics, Professor Hill 
succeeded nevertheless in his objective to have the book “kept within 
limited bounds and confined to ‘arithmetic guided by logic’.” 


The Teaching of Statistics: A Report of the Institute of Mathematical Statistics 
Committee on the Teaching of Statistics. Harold HoteUing, Chairman (Professor 
of Mathematical Statistics and Head of the Department; Associate Director, 
Institute of Statistics; University of North Carolina), Walter Earthy (Dean of the 
Division of the Physical Sciences and Professor of Applied Mathematics, Univer¬ 
sity of Chicago, Chicago, Ill.), W, Edwards Deming (Adviser in Sampling, Bureau 
of the Budget, Washiagton, D. C.), Milton Friedman (Associate Professor of 
Economics, University of Chicago, Chicago, Ill.), and Paid Eoel (Associate Pro¬ 
fessor of Mathematics, University of California, Los Angeles, Cahf.). Reprinted 
from Ann Math Stat 19(1): 95-115 Je ’47. Washington 25, D. C.: National Re¬ 
search Council (2101 Constitution Ave.}, 1948. Pp. 95-115. Paper. 

The Teaching of Statistics in Universities and Colleges. Based upon a report 
by the Royal Statistical Society’s Committee on the teaching of statistics: E, S, 
Pearson (Chairman), R. G, D. Allen, H, Campion, WiUiam Elderion, C. Oswald 
George, R. F. George, M, Greenwood, D. Heron, J. 0, Irwin, H, Leak, E, (7. 
Rhodes, R. Stone, L, H. C. Tippett, ajidJ. Wishart. Preprint of Royal Stat Soe 
110(1): 51-7 pt 1 ’47. London W.C.2: Royal Statistical Society (4 Portugal St.), 
1947. Pp. 7. Paper. 6d. 

Reviewed by Tbuman L. 

Professor of Education, Harvard University 
Cambridge, Massachusetts 

THESE reports cover the same ground, it is advantageous to compare 
their points of view and recommendations. The British report is the 
more realistic in that it starts with the status quo in the United Kingdom 
and Eire, where the teaching of statistics is not nearly as extensive as in the 
United States, and makes recommendations which may, perhaps, be at¬ 
tainable under the confiicting academic pressures. The American Report is 
idealistic. It states with conviction and well-documented argument what 
ou^t to be the situation, but it pays scant attention to the sundry historical 
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ori^BS of statistical development and teaching, or to the multitudinous 
reasons for specialization and differentiation. 

The British report states that "There are few more urgent needs today 
than the increase of the supply of frst class statisticians,” and the American 
report repeatedly expresses the same idea. 

The British committee asserts that social science workers require a knowl¬ 
edge of "descriptiye statistics.” There is no specific reference to this initial 
and elementary need in the American report. This matter is of importance, 
for the American committee believes that a knowledge of basic statistical 
principles is a part of a liberal education which all college students should 
get. This basic need is acquaintance "with the broad principles of inductive 
inference.” It seems to the reviewer that there is implicit in the British recom¬ 
mendation an induction of the student into statistics via the subject matter 
of his field of spedalization, and in the American an induction via logic, 
including principles of mathematics and probability. It is needless to say 
that these approaches are far asunder. By the one approach the student 
is initially aware of virginal data and with later study acquires the power of 
correctly appraising it, while the other approach would see that the student 
first acquires the powers and the tools for sound appraisal and then applies 
them to the field of his choice. 

This second approach seems necessary if all students, irrespective of their 
variety of interests, become acquainted with statistics through a general 
and comprehensive orientation course. Both committees favor the inclusion 
of statistics (of a sort not clearly defined by the British committee) as a 
minor part of a universal basic course. 

Both committees believe, as stated by the British committee, that: the 
statistical approach is so fundamental to the modem way of looking at 
things . .. that it should form a part of the mental equipment of the edu¬ 
cated man ...” This committee adds, with refreshing realism and modesty, 
that "... we do not propose at present that the universities should attempt 
to fill this gap in general education ...” However, both committees attack 
the problem, for they propose statistical courses, and the American commit¬ 
tee goes into detail in recommending the type of instructor that is needed. 

At the lowest level the British committee advocates a general course 
to familiarize students with statistical ideas and the most commonly used 
elementary methods, and the American committee asserts that "the funda¬ 
mental logic and philosophy of statistics can be taught at an early stage.” 
This is to include the broad principles of inductive inference, concepts of 
sampling variation, randomness, and statistical predictability, the difference 
between inductive and deductive statements, the nature of statistical esti¬ 
mation, and the nature of statistical hypothesis. As a teacher who has tried 
a variety of approaches, the reviewer recommends one which is not clearly 
within the scope of dther of these committee recommendations—^it is via 
data already within the field of interest and acquaintance of the student. 
The phenomena of social and scientific experience constitute the starting 
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point. Their need for interpretation through sound logic is so obvious and so 
personally felt that there is no sense of unreality and no lack in motivation. 
Because of the variety of interest^ a single introductory course does not suffice 
so, contingent upon sufficient registrants, there may well be introductory 
courses in statistics in biology, agriculture, economics, education and psy¬ 
chology, physical sciences, etc. Such an approach is vigorously castigated by 
the American committee, which advocates an introductory course under the 
aegis of the department of mathematics (expanded, to be sure, to include an 
expert statistician) and which even goes so far as to imply that the present 
situation wherein many departmental introductory courses are given by 
semi-trained statisticians is worse than if no courses at all were offered. 

Both committees admit the paucity of adequately trained teacher person- 
nd. In the light of it we must, as a practical matter, consider the benefits 
derivable from a course given, say, in the college department of economics 
by (a) a well-trained mathematician and statistician who is quite ignorant of 
economic issues, (6) a well-trained economist who is quite ignorant of statis¬ 
tical niceties, or (c) one semitrained in both statistics and in economics. The 
reviewer observes that situations change and that with national standards 
in the various disciplines set by such men as Fisher, Hotelling, Pearson, etc. 
a second giving of a course by an (a), a (6), or a (c) will migrate from a first 
^ving in a commendable direction. We can say that two decades of American 
procedure that has been contrary to that endorsed by the American com¬ 
mittee has resulted in a truly remarkable advance in experimental and ap¬ 
plied statistics. The British committee implies what the decades have 
proven, that high competence in statistics has not been dependent upon 
college courses in statistics. Such courses certainly should broaden the base 
from which individual genius can rise to an apex, but let us be wary of a 
definition of path which attempts to fix the initial steps or the terminal 
product. 

The reviewer gathers that the American committee pictures statistical 
training through the following sequence: (1) a first course, or part of a course, 
for all students, given by a mathematical statistician; (2) a course for “fut¬ 
ure consumers” given by an applied statistician; (3) a course for “future users 
of statistical methods” which is in part mathematical statistics and in part 
applied statistics; and (4) a course for “future research workers and teach¬ 
ers ... ,” given under the original parental roof of mathematical statistics. 
The British recommendations are not so specific, but they write, whether 
with approval or regret is not clear, that “it seems inevitable that there 
should be separate courses for... students specializing in various sciences.” 
They also say that “the necessity, at this level, of supplementing lectures by 
practical work cannot be too strongly emphasized.” The necessity for a 
number of centers for advanced courses is emphasized. The sort of advanced 
course here implied seems quite similar to number (4) of the American com¬ 
mittee scheme, for they write “the basis of statistical theory and its ally the 
theory of probability is essentially mathematical.” 
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The American committee, by omission and by inclusion, reveals what it 
considers to be preparatory background for students of statistics. It at no 
point cites knowledge of data in some scientific field as essential. It urges 
knowledge of the calculus—^making the happy suggestion that it is appro¬ 
priate for high school study—^but fails to emphasize matrix and other ad¬ 
vanced algebra which are much more important for statistics. In the judg¬ 
ment of the reviewer, its greatest failure is in not building upon the usual 
normal psychological course of student development—(a) knowledge of 
general afiairs, including a smattering of mathematiGS, (&) a developing 
interest in some scientific field, (c) an epochal experience when data are dis¬ 
covered to be an original source of new knowledge, (d) a determination to 
utilize this source and to acquaint oneself with the techniques for making it 
reveal its hidden information. Not until this last is tapped can we expect 
that men wiU devote themselves to the training necessary to become great 
statisticians and experimentalists. The reviewer believes that the end prod- 
duct in the teaching of statistics should transcend, if not in height certainly 
in breadth, that implied in the term "mathematical statistician.” 

The American committee deplores the general lack of mathematical com¬ 
petence of most teachers of statistics in different subject matter fields. This 
is deplorable as is their lack of knowledge of the genius of data in their fields. 
However, the progress of recent decades should make one optimistic, and 
these two committee reports should encourage college presidents to strengthen 
and broaden the instruction in both mathematical and applied statistics. 


Nomography. A. S. Levens (Associate Professor of Mechanical Engineermg> 
University of California, Berkeley, Calif.) New York 16: John Wiley & Sons, Inc. 
(440 Fourth Ave.), 1948. Pp. viii, 176. 63.00. 

Eevibw by M. Kac 

Associate Professor of MathemalicSf Cornell University 
Ithaca, New York 

T his book is devoted to an elementary presentation of the basic principles 
used in design of nomograms (alignment charts). The material is stand¬ 
ard but attractively presented and illustrated by numerous examples. 

The mathematical prerequisites are modest and consist of no more than 
elementary algebra (through logarithms) and plane geometry. 

The book is an outgrowth of nomography courses for engineering students 
and it is therefore only natural to find most of the applications and illustra¬ 
tions drawn from that field. There is no doubt that nomograms are useful as 
a computational aid in many engineering problems. The reviewer doubts 
however, that nomography will prove very useful to statisticians. In fact, 
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out of thirty-odd nomograms collected in the Appendix and which in the 
words of the author “may prove useful in the fields of engineering, produc¬ 
tion, business, and statistics” there are only two taken from the field of 
statistics. These are a nomogram for the standard deviation (p. 170). 



and a nomogram for the correlation coefficient (p. 171) 


^xy 

r sa—=- —• 

Both of these nomograms can be used only after Sx®, S^/®, "Zxy have been 
calculated. Since the computation of Sa;®, Sy®, Xxy is by far the most labo¬ 
rious part, it seems a little naive to expect that the above nomograms will 
introduce a significant saving of time. 


Foundations of Economic Analysis. Paul Anthony Samuelson (Professor of Eco¬ 
nomics, Massachusetts Institute of Technology, Cambridge, Mass.). Harvard 
Economic Studies, Vol. 80. Cambridge 38, Mass.: Harvard University Press 
(38 Quincy Street), 1947. Pp. xii, 447. $7.50. 

Review by Gerhard Tintner 
Research Associate, Department of Applied Economics 
Cambridge University, Cambridge, England 
ON leave: Professor of Economics and Mathematics 
Iowa State College, Ames, Iowa 

T his is a very important contribution in the field of pure economics which 
ought to be of interest to econometricians and those statisticians who con¬ 
cern themselves with economic matters. 

The author endeavors throughout to derive “operationally meaningful” 
propositions in economics from the assumption of maximizing behavior. 
Operationally meaningful statements are hypotheses about empirical data 
which could conceivably be refuted, if only under ideal conditions. These 
ideas are related to those of some modern positivist philosophers, especially 
Bridgeman. It is perhaps unfortunate that the author has not included a 
methodological chapter in his book. An explicit statement of his philosophical 
ideas may have contributed to the understanding of his book. It may have 
helped some economists who are sceptical about the applicability of radical 
positivism and behaviorism in the social sciences. 
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The first part of the book is concerned with economic statics. It also in¬ 
cludes a chapter on welfare economics, which seems a little out of place. 
The theory of maxima in all its ramifications is treated at great length. There 
is also a more systematic mathematical appendix on this subject. 

On the basis of a very general approach the theory of cost and production 
and also the theory of consumer behavior is ably stated. An interesting 
feature in the treatment is the discussion of maxima if the functions in ques¬ 
tion are not continuous. This is a very important advance, though the dis¬ 
continuities considered may not be the most important ones met with in 
economic life. 

Monopoly and monopolistic competition get very little attention, to say 
nothing of more complicated situations like duopoly and bilateral monopoly. 
This narrowness of the point of view makes the book a little less useful than 
it might have been. 

A short section on the economic theory of index numbers (pp. 146-156) 
should be of particular interest to the statistician. It is probably the best 
short statement available about the modem theory of cost of living index 
numbers- 

A very interesting chapter on welfare economics concludes this first section 
of the book. In its brief compass it contains most of the important new results 
of the modern theory in this field. 

The second part of the book deals with economic dynamics. But the dis¬ 
cussion is almost entirely based upon purely formal consideration of equilib¬ 
ria and their relation to comparative statics. The “correspondence” principle 
of the author states that “the problem of stability of equilibrium is inti¬ 
mately tied up with the problem of deriving fruitful theorems in comparative 
statics” (p. 258). The author succeeds in reaching very interesting conclu¬ 
sions in this way. His work is inspired by the modern theory of dynamics (in 
physics), especially by the work of George D. BirkhofP (Dynamical Systems, 
1927). A mathematical appendix deals with the theory of difference and 
other functional equations. The examples given in this section of the book 
include also a very interesting “dynamization” of the Keynesian system. 
But it appears here and elsewhere that not many definite conclusions can be 
drawn, even after making a number of somewhat arbitrary assumptions. 

The second part of the book is hence somewhat disappointing to the econ¬ 
omist. It is evident that not many valid and interesting conclusions can be 
drawn from such narrow assumptions as made by the author. The whole 
theory of anticipations and expectations is either neglected or appears only 
incidentally. Uncertainty hardly gets an adequate treatment in the small 
space allotted to it. Samuelson's book is in this respect inferior to the earlier 
work of J. R. Hicks (Value and Cajntal, 1939) and others who have en¬ 
deavored to deal with problems of economic risk and uncertainty. I believe 
that especially the theory of formation of anticipations forms a true fink 
between static economics and the more useful and interesting economic 
dynamics. 
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Chapter 11 deals with the classification of equilibria. This discussion is 
again based upon modern (physical) d 3 mamics. The usefulness of the dis¬ 
tinctions made in economics is not immediately apparent. The subsequent 
discussion of the business cycle is very short and not on the high level main¬ 
tained in the earlier parts of the book. 

The author devotes towards the end of the book a few pages to the theory 
of stochastic systems. The treatment is much too brief to yield important 
results. Some of the ideas presented may however prove stimulating to the 
statisticians working in this field, especially the indications about nonlinear 
stochastic systems. 

In summary, the book ought to be recommended very highly to the mature 
reader who possesses an adequate knowledge of modern economic theory and 
higher mathematics. A conclusion which he may draw from its contents is 
the following: Assuming only maximizing behavior (something like the 
"economic principle” of the elementary text books) very few interesting 
conclusions about economic behavior can be drawn even by so brilliant a 
theoretician as Samuelson. There is a great deal of valid and important 
economic theory contained in his book. The reader might be inspired by the 
comparative failure of pure economics to devote his interest to a more prom¬ 
ising approach. Econometric investigations combine modem economic 
theory, which is so ably presented in this book, with modern statistical 
methods in order to derive valid empirical conclusions. 
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COMMERCIAL USES OF SAMPLING* 

J. Stevens Stock anb Joseph R. Hochstim 
Opinion Research Corporation^ Princeton, New Jersey 

To illustrate how a commercial research agency meets its 
sampling problems, eight surveys of widely varying types are 
briefly described. 


I 

T he purpose of this paper is to report on some of the commercial 
uses of sampling. We shall describe a series of actual examples which 
show how a commercial research agency meets its sampling problems. 
As in all sampling problems, the nature of the research objective and 
the facilities at hand are the all-important conditions which determine 
the approach. 

1. Most commercial agencies must be prepared to tackle an extremely wide 
variety of sampling problems. No one sampling procedure or principle is 
applicable in aU, or even in a majority of, cases. 

2. The actual budget is frequently not always commensurate with the diffi¬ 
culty of the problem. Often it is necessary to do the best job possible un¬ 
der a very restrictive budget, even though a larger budget would allow for 
much greater dollar efficiency. 

3. Most commercial research problems are geared into current-policy de¬ 
termination. Consequently, research efficiency must sometimes be sacri- 
flced for speed. 

4. Although there are important exceptions, commercial research problems 
are for the most part nonrecurrent. Time series and repetitive studies are 
conducted, but are relatively small in number. 

5. Many research problems must be tackled with limited source material 
because of the specialized nature of the problems. 

6. Many commercial research studies must cover unusual, difficult-to-define 
universes of people, such as the purchasers of a certain commodity, all 
chemists in the United States, all people who wiU vote in the next election, 
etc. 


* Presented at the 25th Session of the International Statistical Institute, a constituent part of the 
International Statistical CSonferences, Wadiington, D. C, September 6-18,1947. This paper ^vill also 
appear in the Proceedinss of the Conferences, whidti will be published in the near future. 
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7. Commercial research agencies often find it necessary to study attitudes 
and opinions which have highly emotional, personal, or morid implica¬ 
tions in the respondent's mind. 

8. Very often, even in large national studies, the commercial research agency 
must make detailed analyses for relatively smaU areas within the country. 
This makes it necessary to maintain extremely large part-time staffs 
widely distributed throughout the nation. 

Eegardless of any pet theories, the sampler has to consider the fol¬ 
lowing: 

1. The problems the user of the data needs to have answered. 

2. The actions likely to be based on the results of the survey. 

3. The degree of precision required. 

4. The size of the budget. 

In any case, the sample design chosen ^ould provide the user of the 
data with Tnfl.TiTniiTn information at •minimum cost. 

n 

Our first example involves the sampling of purchasers of alcoholic 
beverages. It is illustrative of the quick, nonrecurrent survey of a 
difficult-to-define universe on a subject that for some respondents has 
moral, personal, and political implications. In one particular state a 
law had been passed increaring the liquor tax. The client, a distilling 
company, wanted to know who was getting the blame from purchasers 
of hard liquor for the increase in prices—^the state government, the 
distillers, or the distributors. The answer to this question would deter¬ 
mine the direction the client’s public-relations effort would take and 
the issues on which it would concentrate. 

Early testing of the questionnaire indicated that people were imable 
or unwilling to give accurate answers as to their purchases of alcoholic 
beverages, except at the time of purchase. The client, however, was 
interested in directing his public-relations effort at the purchasers of 
his product so that it would have the greatest impact. It was necessary, 
therefore, to design the sample so that the number of respondents was 
approximately proportionate to the amount of total liquor sales 
•throughout the universe. 

In the state baing surveyed packaged alcoholic beverages are sold 
only in state liquor stores. Data on liquor sales were available for the 
past year for different districts of the state, for indi'vidual towns in 
each coiml^, and for indi-vidual stores. The sample was derigned as 
follows: 

The number of intervie^ws -with purchasers in the various districts 
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was computed in proportion to the amount of liquor sales in each dis¬ 
trict. The two largest cities of the state were removed from thdr dis¬ 
tricts and treated as separate units. Interviews were allocated to these 
two cities separately according to the amount of their own liquor sales. 

Each district was divided into three groups of counties: those with 
high per-capita consumption, those with medium, and those with low 
per-capita consumption. 

'Within each of these three groups of counties the towns were strati¬ 
fied by per-capita volume of liquor sales. One sample town was selected 
from each substratum. The number of interviews with liquor purchasers 
allocated to each sample town was in proportion to the total volume of 
liquor sales of the substratum from whidi the town was selected. 
Within each town a list of stores was prepared and aiTanged in rank 
order according to volume of liquor sales. Stores were then selected 
within each sample town S3retematically with probability of selection 
proportionate to sales volume. 

Interviewers were instructed to stand outside the stores selected and 
to interview every nth person leaving the stores until their allocated 
number of interviews was complete. In order to assure proper distribu¬ 
tion by time of day and day of the week, interviewers were instructed 
to interview half of their respondents on Friday afternoon and Satur¬ 
day, and the other half on other days of the week. In addition, the in¬ 
terviewers were instructed to fill half their their quotas before 4:00 
p.u. each day and half of them after 4:00 f.m. each day. 

Ill 

Our next example is typical of a study of the community relations of 
an industrial firm. 

The largest employer in a Midwestern town of about 100,000 popular 
tion felt that his company had done a good job of public and industrial 
relations. But in spite of aU its efforts the company had a strike. The 
executives wanted to know very quickly what could be done to correct 
an apparent hostility of the citizenry of the community. The problem 
of the survey was to find what the people in the community were 
thinking about the company and its employees and what the company 
could do in the opinion of the community to make the plant a better 
place to work. 

The 1940 Census Block Statistics, which were available for this city, 
were brought up to date in numbers of dwelling units with the aid of 
the city planner’s data on new construction. All blocks were then 



512 AMERICAN STATISTICAL ASSOCIATION JOURNAL DRCEMBBR 1018 

stratified first by census tract, then by average rent. Within each of 
these strata they were arranged by the number of dwelling units per 
block. The sample blocks were selected by systematic random sampling 
with probability of incluaon proportionate to the number of dwelling 
units in each block. 

Within each block the interviewer was given a starting point and 
instructed to go in clockwise direction and to interview every nth 
dwelling unit in the assigned block until the quota for that block was 
filled. 

In cases of noninterview because of refusal or because no one was at 
home, interviewers were told to go to the ne^ dwelling unit, but to 
continue the count from the dwelling unit originally designated. In 
order to minimize the bias resulting from the omission of not-at-homes 
and to ^ve more adequate opportunity for getting employed workers 
in the sample, interviewers were required to do their interviewing in the 
evening and over weekends. 

Quotas covering the whole city by sex and age were given to the 
supervisor. By cumulative daily taUies of the interviews returned, the 
supervisor was able to watch the representativeness of the survey. He 
asagned vaiying quotas to the interviewers for the selection of in¬ 
dividuals within the sample households when certain age-sex groups in 
the population appeared to be under-represented according to the 
over-all quota. 

Note: Becent experience with this combination of areal and quota tech¬ 
niques indicates that much of the interviewer’s tendency toward bias 
is eliminated by the systematic pre-selection of each dwelling unit in 
which an interview is to take place. The use of quotas for the selection 
of an individual within each household allows the interviewer very 
little free choice. 

IV 

The next example is a survey to measure the attitudes of social- 
sdence teachers toward basic principles of free-enterpiise economics. 
The particularly difficult problem in this case was to get a cross section 
of a hi^y specialized population, namdy, college and high-school 
teachers of government, economics, sociology, and history. 

The sample of high-school teachers was allotted by states according 
to data on high-school enrollment published by the Office of Education. 
Interviews were allocated to dties and towns in proportion to the 
population distribution of urban places within each state. The selection 
of the 160 dties and towns within states a-uH dze-of-town groups was 
determined by the location of the company’s resident interviewers. 



COMMERCIAL USES OF SAMPLING 


513 


Note: At first glance, it would seem the selection of towns to suit the con¬ 
venience of the resident interviewers is quite unrigorous. In this case, 
however, it is an excellent means of selecting a random sample of 
towns. Since there is no conceivable correlation between the methods 
by which part-time resident interviewers are recruited and any atti¬ 
tude characteristic of the social-science teachers being studied, we 
can say that we have selected our sample at random. 

It is a common practice in commercial research firms (and others as 
well, we presume) to base the selection of localities for the cross section 
on the location of the resident staff. Many suspect this technique of 
bias. However, as long as the investigator can guarantee that the 
method of recruiting is in no predictable way connected with the 
problem studied, we can be assured that the selection of localities 
is as random as if it had been based on the use of random numbers 
or some game of chance. 

Most commercial research agencies that maintain large part-time 
resident interviewing staffs recruit them in proportion to the distribu¬ 
tion of the population by region, state, and size of town groups within 
states. In this case the total maximum number of resident inter¬ 
viewers available was approximately 1,000 located in more than 600 
places. 

As for college teachers, a complete list of all colleges was prepared by 
type—senior, junior and teacher coEeges—and was arranged by states. 
The number of teachers was recorded for each college, and the colleges 
were put into two groups: those with a teaching staff of 200 or more and 
those with fewer than 200 teachers. Every nth college was selected for 
the sample. 

Since in neither case was there available a total count on any social 
or economic characteristic of teachers of social-science subjects, it was 
impossible to assign any quotas by characteristics. It was necessary 
therefore, to insure a strict randomization of the respondents. Accord¬ 
ingly, interviewers were instructed to obtain a complete list of social- 
science teachers for the sample colleges and all high schools in the 
towns selected. The interviewers were further asked to alphabetize 
this list and to interview the first n teachers starting with a given 
letter of the alphabet. Interviewers were given different starting 
letters. 

A separate part of this survey illustrates the use of mail question¬ 
naires in stud3ring a highly specialized universe. A large industrial 
company had conducted for social-science teachers a coxnrse of several 
weeks in some of the social problems faced by business and the tech¬ 
niques used in solving these problems. The purpose of this study was 
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to measure the impact on teachers of impressions obtained in this 
course. 

A complete list of all participants in the course was obtained. Post 
cards were mailed to participants asking them to state by return mail 
whether they would be willing to answer a mail questionnaire. 

A great majority of the teachers expressed their willingness to answer 
the mail questionnaires. When the returns were counted, however, 
slightly less than 50% of all participants actually mailed questionnaires. 

Since it was felt that the returned sample might not be representative 
and that those teachers who had not returned the mail questionnaires 
might be different in their characteristics and attitudes, it was detided 
to take a separate sample of nonretumers by personal interviews. 

The industrial company that gave the course was located in a very 
large Midwestern city, and about 40% of the teachers attending the 
course lived within an area of 15 miles from the plant city. It was de¬ 
cided that resident interviewers were to telephone all social-science 
teachers included in the oripnal list who lived within a 15-mile radius 
of the plant city. They were to inquire whether the teachers had re¬ 
turned the mail questionnaires and if not, to make an appointment for 
a personal interview. 

By this method, interviews with approximately one fourth of the 
teachers who did not answer the mail questionnaires were secured. It 
was then assumed that those who did not answer the questionnaires and 
who lived in the plant-city area were representative of all the teachers 
not answering the questionnaire. Accordingly, the answers to personal 
interviews were weighted to give proportionate representation to those 
who had, and those who had not, answered the mail questionnaires. 

V 

Much commercial research is devoted to studying the style or prod¬ 
uct preferences of highly specialized groups in the population who are 
actual or potential customers for a certain commodity. The purpose of 
such research is to get rou^ estimates of relative customer preference 
in order to guide manufacturing and sales policy. 

A survey of this kiud was recently conducted to determine style 
preferences for a particular piece of furniture. In order to set up as 
nearly as possible a realistic buying situation, the study was restricted 
to three separate groups: 


1. Wdl-to-do respondents were questioned on styles in lugh-priced items. 
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2. Average and medium-well-to-do respondents were questioned on medium- 
priced items. 

3. Poor respondents were questioned on styles in low priced items. 

To simplify the approach and meet the restrictive budget, interviews 
were conducted only in towns with population of 50,000 or more. All 
towns having 50,000 or more population were stratified by geographic 
sections. Interviews were allocated to cities and towns in proportion 
to their population in groups of 15 or multiples of 15. In the case of 
small cities, this amounted to arraying them by population size and 
selecting an ordered random sample of cities. However, instead of 
selecting every nth city, we selected every nth person, so that the 
probability of selecting any one city was determined by the chance of 
selecting any one person living in the city. Therefore, the probability 
of selecting each city was proportionate to its population size. 

Note: A common practice of commercial research agencies is to assign in¬ 
terviews in groups of 10 or 15, in order to give each interviewer an 
assignment large enough to make it worth his while to work on the 
study and small enough so that he can finish his interviewing in the 
time limits set for the study. 

Within each of the cities selected a relatively large primary sample 
of 100 to 300 blocks was chosen by selecting every nth block from the 
published block statistics. From these large samples of blocks smaller 
samples of blocks were selected for assignment. 

Note: This practice of selectii^ relatively large primary samples of blocks, 
then stratifying them by section of the city, rent, and numbers of 
dwelling units, for further selection of smaller samples of blocks 
amounts to dovbk sampling. It appears to be a very useful practice 
where small samples of blocks from relatively large dties are needed. 
The very practical reason for this kind of double sampling is to save 
on the burdensome clerical time that would be required to stratify 
all the thousands and tens of thousands of blocks in a large city. It 
is necessary only to stratify the primary sample of lOO to 300 blocks. 

Within each city the primary sample of blocks was divided into three 
broad groups: high-rent, medium-rent, and low-rent blocks. In the 
large cities in the sample, where as many as three groups of 15 inter¬ 
views were asagned, three interviewers were assigned to sample blocks 
corresponding to each of the three rental groups. In smaller dties, where 
only one or two sets of 15 interviews were asdgned, interviewers were 
required to interview in sample blocks from (correspondingly) onty 



516 


AlfBBICAN STATISTICAL ASSOCIATION JOTTBNAL DBCEMBNB 1848 


one or two of the rental types. Thus, no attempt was made in smaller 
cities to get a cross section of each city since no individual city anal 3 r 8 es 
were required. 

Note: The leason for this assigmnent procedure was that a different ques¬ 
tionnaire was used for each type of respondent—^the well-to-do, the 
average, and the poor. Because of the complicated nature of the inter¬ 
view and the dai^er of confusing the questionnaires, only one type 
of questionnaire was assigned to each interviewer. 

Blocks within each of the three broad rental groups in each sample 
dty were selected by stratifying the primary sample of blocks for each 
broad rental group into three strata according to rent. From each of 
these three rental substrata, three blocks were selected at random 
with probability of selection in proportion to the number of dwelling 
units on each block. Thus, each interviewer was assigned one type of 
questionnaire in one broad rental group. Further, he was assigned 
three blocks from each of his three rental substrata (9 blocks in all). 
The interviewer was instructed to take five interviews from each set of 
the three blocks in each rental substratum. He was also instructed to 
go aroimd as many of the three blocks as necessary to contact respond¬ 
ents owning the particular piece of furniture or saying they were in the 
market for it. If the respondent did not meet this qualification, the 
interview was discontinued and contact was made with the next dwell¬ 
ing unit. Wherever the respondent owned such a piece of furniture or 
was in the market for one, the interview was completed. 

Various styles of furniture of different price range were judged by 
respondents (higher-priced pieces by respondents living in the higher- 
rent blocks, lower-priced pieces by respondents in the lower-rent 
blocks). On a few pieces of medium price, however, we measured 
preferences in all three groups for purposes of experimental control. 

VI 

A frequent problem for any research agency is the high expense of 
travel from interview to interview. The next example illustrates a 
problem in studying farmers’ opinions, one of the worst situations 
from the standpoint of interviewer travel cost. 

During war time, government regulations determined prices and 
conditions under which much farm produce could be sold by farmers 
to food manufacturers and processers. There was considerable resent- 
m^t on the part of the farmers against these regulations. There was 
also resentment against the manufacturers who in the opinion of some 
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farmers manipulated these regulations to their own advantage and to 
the farmer’s disadvantage. 

The purpose of the survey was to study farmers’ sentiments on this 
issue. It was decided to restrict the study to seven counties in a Mid¬ 
western state—an area where the client did most of his purchasing. 

Interviews were allotted to each coimty in proportion to number of 
farms, but in order to save cost, interviewing was limited to every 
other township. 

State highway maps were obtained on which square-mile sections 
within each sample township were numbered. 

The mnnber of sections within each sample town^p was recorded, 
and a cumulative total of these sections was obtained. Then the 
average number of farms per section was computed. In each county 
the desired mnnber of sections was obtidned by dividing the assigned 
mnnber of interviews by the average number of farms per section. In 
order to concentrate interviewing still more, two adjacent sections 
were combined into one double section. The sampling ratio was then 
obtained by dividing the total number of sections in the sample of 
townships by the number of double sections wanted. An ordered sample 
of every nth double section was selected. Interviewers were instructed 
to interview every farmer in the double sections selected, making call¬ 
backs wherever the farmer was not found at home. 

Note: This conceatratioa of interviews in double sections is, of course less 
efficient in a statistical sense than a sample of single sections or in¬ 
dividual farms. In the information-per-doUar sense, however, it is 
undoubtedly much more efficient. Moreover, it greatly facilitates call¬ 
backs. 


VII 

Frequently firms desire to make opinion studies among their custom¬ 
ers, or associations want to poU a cross section of their members. 
Often in such cases the client firm or association can supply a complete 
list of the universe to be studied. The sampling problem is then usually 
very easy; we stratify the names on the list as efiGiciently as posable 
and select every nth name. 

Shortly before the end of the war, the planning board of a metro¬ 
politan area was concerned with the potential post-war employment 
in the area. During war time the population of the area had increased 
very considerably, and it was important to make an estimate of the 
number of people for whom employment mi ght be provided in the 
years to come. The budget for the survey allowed interviews with 1,500 
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employers in the ai-ea. A complete set of cards shoTidng each firm, its 
industry group, and the numbers of its employees was made available 
by the client. In the area to be studied there were approximately 760 
firms having 50 employees or more, which accounted for about 64% 
of all mnployees in the region. It was decided to include all these 750 
firms in the sample. 

Of the remaining 15,000 firms having fewer than 50 employees every 
20 th one was selected. 

Thus, approximately equal numbers of firms having more and fewer 
than 50 employees were selected for the sample. By full coverage of 
firms that were more important in terms of emplo 3 unent, the statistical 
error in projecting the average number of employees was reduced 
considerably. 

Note: This example illustrates the common case in which certain segments 
of the population to be studied contribute such high variability to the 
final statistical estimate that it is efficient to sample disproportion¬ 
ately from different strata. This amounts to a very simple application 
of what is commonly called "Neyman sampling.” 

The cards for employers of 60 persons or less were stratified by de¬ 
tailed industry groups, and were arranged within each group in ascend¬ 
ing order from the smallest to the largest firms. Every 20th card of 
each industry group was then selected, carrying over the count from 
one industry group to the next. 


VIII 

The next survey, which uses quota sampling, illustrates a very im¬ 
portant and useful technique for studying many different and wide¬ 
spread populations at the same time. 

In this instance the purpose of the smrvey was to bring out pertinent 
facts bearing upon the public-relations program of a particular industry. 
A variety of issues was to be covered, and each issue was to be ex¬ 
plored thoroi^hly. 

The research problem was to describe in detail the attitudes of many 
different “publics” toward this industry; 

1. Manual worlceta employed in this particular industry. 

2. Besidents of each of the major areas (each was analyzed separately) in 
which the industry is concentrated. 

3. Separate cross sections of the general population in seven geographic 
re^ons. 

4. A national moss section of the continental United States. 
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Note: More and more in public-relations research it has been found useful 
to study the "concentric circles” of populations toward which the 
public-relations effort is being directed. This puts great premium on 
developing efficient sampling techniques for overlapping populations. 

In such cases ingenious and workable methods are needed to take 
fullest advantage of the additional efficiency gained through making 
each successive sample contribute to the sample of the next con¬ 
centric population. 

The national cross section was made up of interviews allotted to ap¬ 
proximately 450 localities in proportion to the poptilation of geographic 
regions, states, and of size-of-town groups. Each interviewer was as¬ 
signed quotas by sex, age, race, employment status and occupation. 
These quotas were based on 1940 Census data adjusted wherever pos¬ 
able to more recent Government estimates. 

In four of the seven geographic regions, the national cross section 
sample was adequate. In the other three, additional interviews were 
taken to supplement those obtained in the national cross section. These 
extra interviews were assigned to the sample localities in proportion to 
the distribution of the general public in these regions. 

Major industrial areas were defined as all coimties m which a mini¬ 
mum of 3% of all employed men were engaged in the clients in¬ 
dustry. These counties were stratified according to population and 
according to per cent of men employed in the industry. These substrata 
were set up so that the aggregate populations of the counties in each 
were approximately equal. A sample of counties was then drawn at 
random from each stratum. General-public interviews in the national 
or re^onal samples which were taken in counties where more than 3% 
of the employed men were engaged in this industry were included in 
the sample of major industrial areas. Consequently, the number of 
interviews allocated to comities in each stratum was reduced by the 
number of interviews already taken in this stratum in other samples. 
Since there was considerable overlap between the different areas ansr 
lyzed, very substantial savings were effected by taking advantage of 
tile duplications. Quotas were assigned by sex, age, race, employment 
status, and occupation in proportion to the distribution of such 
characteristics in each major industrial area. 

Interviews with industry employees were taken in the sample 
coimties in the major industrial areas in proportion to the emplosrment 
in the industry in those areas. The remainder of the industry employee 
sample was allocated among the national sample localities in propcn:- 
tion to the employment in the industry in those localities. 
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Note: Quota sampling has been widely condemned in recent years. Prin¬ 
cipally, its validity can be questioned on the underlying inverse logic 
on which it is based—^namely: 

1. A representative cross section will reflect known characteristics 
accurately; 

2. Therefore, a sample which reflects known characteristics accurately 
is a representative cross section. 

In spite of this dubious logic, it can be shown that under certain not 
uncommon conditions quota sampling may have the highest cost 
efficiency of any method now known. 

IX 

A very interesting and ingenious application of an older technique, 
which has been largely abandoned in the past ten years, has recently 
been reported by Benson, Young, and Syze.^ 

This technique, now known as pinpoint sampling, was used by the 
American Institute of Public Opinion in connection with the 1944 
presidential election. 

Past election results for five particular states were found to represent 
a remarkable reflection of national election outcomes for all two-party 
contests since 1916. 

Within each one of these five states, eight precincts were selected for 
the sample. The precincts within each state were chosen in such a way 
that, ^^giving equal weight to each, the average of the percentages of 
the vote of either major party was virtually the same as the election 
result for the state.” This was found to have been true in three previous 
statewide elections. 

Within the precincts selected interviewers were instructed to inter¬ 
view in every—or every other—dwelling unit, depending on the num¬ 
ber of voters in the precincts. The interviewers still retained the free¬ 
dom of selecting respondents within the family group, except for the 
limitation of obtaining over-all quotas for sex and age. 

Interviews were completed in about one half of the households in the 
sample election districts. Slightly more than one fourth of the potential 
voters were interviewed. 

The results show a deviation of 1.9% from the actual national vote. 
Part of this deviation was due to the selection of states and part to the 
selection of precincts. Benson, Young, and Syze concluded from the 


^ Edward G- Bensoxi, C^yrus C. Voung, and Clyde A. SyzCp ^Polling Lessons frozn 1944 Elec* 
tion.* TAe PuWm Opinion Quartedy, Winter 1946-46. The authors are indebted to Messrs. Benson, 
Young, and Syse for their permission to inresmit their 
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results that improvements could most profitably be made by enlarging 
the number of sample areas and taking a smaller number of interviews 
within each area. 

We have here a case of sampling by purposive selection. 

Note: The use of the method of purposive selection has been roundly con¬ 
demned by many writers on the subject of sampling. Gini and Galvani 
in an article in Annali Di Statistica in 1929 showed empirically that 
this method was very inefficient. Later Neyman in his now famous 
article, “On the Two Different Aspects of the Representative 
Method,” The Journal of ike Royal Statistical Society, 1934, confirmed 
theoretically Gini*s and Galvani’s findings. 

Neyman, however, did express the opinion that under certain very 
rigid conditions purposive selection might be an efficient method of 
sampling. One of those very rigid conditions was that the technique 
might “work” when the controls were very highly and linearly cor¬ 
related with the statistics to be measured. Although Benson, Young, 
and Syze do not discuss this point, it appears that this condition has 
been met. Therefore, it seems reasonable that the method of pinpoint 
sampling may have very important possibilities in predicting election 
results. 

X 

Although the problems already presented indicate the great variety of 
situations a commercial research agency has to meet^ there are many 
more entirely different sampling and interviewing situations that 
cannot be covered in detail here. 

Some of the sampling techniques described vary according to the 
particular survey situation. In a survey taken in a town and in its 
surrounding rural area, for neither of which block statistics were avail¬ 
able, the sample was designed by drawing routes on a town map and 
state highway maps. The routes were drawn in such a way as to include 
different sections in the town as well as the different types of roads in 
the survey area. Interviewers were to interview in every nth house on 
the right side of the route designated until their quota was completed. 
Different starting points were chosen for the routes, some leading from 
the town towards outlying areas, others starting as far as 15 miles from 
the town and leading toward it. 

Another example of a special dtuation: Even though interviews are 
generally done personally with only one respondent in any one house¬ 
hold, there are cases where it is best to interview husband and wife to¬ 
gether in order to approximate a family bu 3 dng situation (for example, 
when a survey is made on intended purchases of refrigerators). 

Even though most questioning is done by personal interview, there 
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are situations where telephone interviews are satisfactory (for example, 
a survey with a short and strictly factual questionnaire and a special¬ 
ized universe all members of which are telephone subscribers). 

In other cases, such as in “in-plant studies,” self-administered group 
interviews are used with a sample including every nth employee from 
the payroll. 

In situations where respondents might feel that their statements 
could place them in some form of jeopardy, it may be necessary to in¬ 
terview on the street to allay any possible suspicion that respondents 
might be traced. 

In cases of a management sample, such as sales managers or public- 
relations directors of companies with assets of more than $6,000,000, 
it may be necessary to sample from such sources as Moody^s and 
Standard and Poor, and from individual state directories, supplemented 
by whatever information is available through local chambers of com¬ 
merce. 

In a survey with stockholders, where the composition of the universe 
was unknown, information was obtained on the characteristics of the 
types of people who are most often owners of stocks by including a 
question on stock ownership in other large national surveys. The 
survey with stockholders could then be restricted to these types, with 
the realization, of course, that the unusual types of stockholders would 
not be included. 

There are numerous other situations—^radio surveys, magazine sur¬ 
veys, advertising studies, and many others—^which require various 
approaches suited to their special needs. 

Looking back at the examples given, we realize that there are many 
other possible sampling approaches to these problems. The purpose of 
this paper is to report, not to analyze. These were the approaches that 
were actually used. They are presented here to illustrate a few of the 
many commercial uses of sampling. 



VARIATION OF THE FREQUENCY OF FATAL 
QUARRELS WITH MAGNITUDE 

Lewis F. Richabdson 
Hillside House, Ktlmun, Argyll, Britain 

A record of wars during the interval a.d. 1820 to 1945 has 
been collected from the whole world, and has been classified 
according to the number of war-dead. The smaller incidents 
have been the more frequent, according to a fairly regular 
graph which can be extended to quarrels that caused a single 
death. 


INTRODtrCTION 

A NYONE WHO TRIES to make a list of “all the wars” (e.g. Wright [18] 
p. 636) encoimters the diflBLculty that there are so many small in¬ 
cidents, that some rule has to be made to exclude them. 

From the psychological point of view a war, a riot, and a murder, 
though dijSering in many important aspects, social, legal, and ethical, 
have at least this in common that they are all manifestations of the 
instinct of aggressiveness. This Freudian thesis has been developed by 
Glover [5], by Durbin & Bowlby [3], and by Harding [6]. There is thus 
a psychological justification for looldng to see whether there is any sta¬ 
tistical connection between war, riot, and murder. I dealt with this 
problem in 1941 by forming the inclusive concept of a “fatal quarrel,” 
and then classifying the fatal quarrels according to the number of 
quarrel-dead [12]. By b. fatal quarrel is meant any quarrel which caused 
death to humans. The term thus includes murders, banditries, mutinies, 
insurrections, and wars small and large; but it excludes accidents, and 
calamities such as earthquakes and tornadoes. Deaths by famine and 
disease are included if they were immediate results of the quarrel, but 
not otherwise. In puzzling cases the legal criterion of “malice afore¬ 
thought” was taken as a guide. 

The record of the number kiUed in a particular war is often uncer¬ 
tain by a factor of two. The meaningful part of the record can be sepa¬ 
rated from its uncertainty by taking the logarithm to the base ten and 
roimding the logarithm oS to a whole number, or to the first decimal, 
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according to the quality of the information. For simplicity I have 
lumped together the deaths on the opposing sides of the quarrel. The 
magnitude of a fatal quarrel is defined to be the logarithm to the base 
ten of the number of people who died because of that quarrel. The 
magnitude will be denoted by fi. The range of magnitude esctends from 
0 for a murder involving only one death, to 7.4 for the Second World 
War. Other well-known wars had magnitudes as follows: 1899-1902 
British versus Boers 4.4; 1939-40 Russians versus Finns 4.83; 1861- 
65 North American Civil War 5.8. The magnitude of a war is usually 
known to within +0.2; so that a classification by unit ranges of magni¬ 
tude is meaningful. These ranges have been marked off at 7.5, 6.5, 5.5, 
4.5,3.5,2.5,1.5 and perhaps at 0.5 and —0.5. Every fatal quarrel must 
lie inside one of these cells; it cannot lie on a boundary, because the 
antilogs of the boundary n are not integers. 

This abstract framework becomes of interest when the facts are sorted 
into it. As far as I have been able to ascertain, the historians have 
never sorted their facts by the scale of magnitude. The chief obstacle 
to any scientific study of wars-and-how-to-avoid-them, is each nation’s 
habit of blaming other nations. National prejudice can however be 
avoided by taking the whole world as the field of study; and by taking 
a time-mterval longer than personal memory. For this reason I have 
made a search for the records of fatal quarrels in the whole world since 
the beginning of a.d. 1820. The task has been long; and the results 
are here presented in brief summaiy. For magnitudes greater than 2.5, 
the facts were mostly obtained from works on history. For magnitudes 
less than 0.5 they were taken from criminal statistics. For magnitudes 
between 0.5 and 2.5 the information is scrappy and unorganized; what 
there is of it suggests that such small fatal quarrels were too numerous 
and too insignificant to be systematically recorded as history, and yet 
too large and too political to be recorded as crime. 

It is important to know whether the facts have all been gathered. 
The best evidence is provided by the progress of search. I began in the 
year 1940. At first my collection grew rapidly, then slowly; then various 
revisions caused its totals to oscillate slightly. The collection appeared 
to be sufficiently complete to warrant the publication of a summary, 
which appeared in Nature of 15th November, 1941. Afterwards the 
publication of Quincy Wright’s list of wars ([18] Appendix XX) pro¬ 
vided a stimulus to further enquiry, which involved the consultation 
of some seventy history-books. In the following table three stages of my 
records are compared. All refer to fatal quarrels which ended from a.d. 
1820 to 1929 inclusive. 
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Date and place of record 

Ends of range of magnitude 

7±i 

6±| 

5±J 

4±| 


Numbers of fatal quarrels recorded 

1941. J^otureof Noy.l5 

1 

3 

16 

62 

1944. J.R.S.S. 107,248 

— 

— 

— 

63 

1947. Here. 

1 

3 

20 

60 


It is seen that there has been a sort of convergence, which makes the 
present numbers worthy to be discussed. That is for /t >3.6. For smaller 
incidents in the range 2.6 </( <3.5 my collection has continued to grow; 
the sign ^ is therefore prefixed to its totals. 

The world-total for the murders was obtained from the murder-rates 
of 17 countries. They were weighted by populations. The weighted 
mean rate for the world was found to be 32 murders per million per 
year. This is not as definite as the statistics of wars, because the crimi¬ 
nal statistics of China, Africa, and South America (except Chile) were 
not available. But even if that estimate were as much as three times 
wrong, which is incredible, it still would give significant results in Fig¬ 
ures 0 and 1, because they relate to such enormous ranges. The evidence 
which I had concerning time-changes in the murder-rate was scrappy 
and confiicting; so that provisionally I took the rate to be constant. 
The mean population of the world over the 126 years from a.d. 1820 to 
1945 was computed from a publication by Carr-Saunders [lA] and was 
found to be 1.49X10*. So the world-total of murders was computed as 
1.49 X10* X32 X10-® X126=6.0 X 10«. 

A revision based on the murder-rates of 21 countries, gave a mean 
rate of 37 instead of 32, and a world-total of 7 instead of 6 million. The 
following diagrams and tables were based on the earlier estimate and 
have not been altered. 

A CONSPECTUS WHICH SUITS THE WABS 

The number of fatal quarrels has been counted, or estimated, in unit 
ranges of megnitude, with the following results for the world as a whole. 
The date of a quarrel is taken to be that of the termination of hostilities. 
The interval is A.n. 1820 to 1945 inclusive. 


Ends of range of magnitude 

r±i 6±* 6±i 

4±i 

3 ± § • • • Murders 

Observed number of fatal quarrels 

2 5 24 

63 

^188-•• 6X10* 


It is desirable to show all these facts together on a single diagram. 
If the number of fatal quarrels per unit range of magnitude were taken 
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as the ordinate, then either the wars would show, or the murders, but 
not both. When however the logarithm of the number of fatal quarrels 
per unit range of magnitude is taken as the ordinate, then all parts 
show equally well. Each murder presumably involved a small number 
of deati^, such as 1, 2, 3, or 4, so that the corresponding logarithms 
were 0.000, 0.301, 0.477, or 0.602. How to group them was not obvious. 
As a first expedient 1 extended to the murders the system of classifiica- 
tion which had been found to suit the wars, namely by unit raises of 
magnitude cut at 7.5, 6.5, 5.5, 4.5, 3.5, and so on by extension to 0.5 
and —0.5. Murders involving 4 or more deaths were regarded as neg¬ 
ligibly rare. The ordinate / is defined to be 


/ = logio 


( number of fatal quarrels 1 
(corresponding range of magnitude/ 


This rough and ready scheme gives the conspectus shown in Figure 0* 
It is the type of diagram with which the author began, and it has now 
been asked for by a referee. A dotted curve has been drawn across the 
gap to suggest that there is statistical continuity between the wars 
and the murders. 


THE FAtrLTS OF FI6UBE 0 AND HOW TO AMEND THEM 

In order to match the wars, the murders have been enclosed in a unit 
range of magnitude, as shown at A in Figure 0. But the half of this range, 
from n= —0.5 to ii just less than zero, is necessarily empty. Would it 
be more reasonable to regard the murders as enclosed in a half-unit 
range? If so the line representing them in Figure 0 would be halved in 
length. Also the number of murders per unit of magnitude would be 
doubled. So the representative line would be raised by logio 2 into posi¬ 
tion marked B. Worse ambiguities occur if we wish to show separately 
the number of murders each causing one death. Let there be M of them. 
To what range of magnitude ought they to be attributed? The range 
must contain the point n=Q, and must not contain the point ju=logu 2, 
but otherwise is unspecified. Yet M has to be divided by this indefinite 
range in order that the reckonii^ may be “per unit raige of magnitude”. 
Figure 0, though suitable for the wars, is too vague for quarrels that 
caused only one death. 

It m^ be thou^t that the root of the trouble is the reckoning “per 
unit lai^e of” anything, and that if ranges were abandoned then mur¬ 
ders and wars could be compared by plotting the logarithm of their 
numb^ against tlidr magnitudes. Such a diagram would consist of 
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points, not segments. There are two objections to this proposal: (i) 
the magnitudes of wars are imperfectly known and some of them are 
rounded off to whole numbers; so that there would be many artificially 
coincident points, (ii) if the magnitudes were better known they would 
presumably be scattered, so that a point-diagram of wars would pre¬ 
sent great irregularities, of no interest. For when n exceeds 1000 there 
is never likely to be any statistically significant distinction between 
wars involving n and n-fl deaths. In bii^ we must have points for the 
murders and averages over ranges for the wars, and yet these diverse 
representations must somehow be compared. The solving idea, which 
melted all the obstacles, was that the appropriate measure of any range 
of quarrel-dead is simply the number of integers in that range, and not 



528 


AMEEICAN STATISTICAL ASSOCIATION JOUBNAL DECEMBEB 1948 


the corresponding range of magnitude, nor any other artificial concep¬ 
tion. 

A DIAGRAM WHICH SUITS THE MURDERS 

The logically simple idea is that there were, say q{n) (1) 
quarrels each involving n deaths. 

For example in England and Wales during 1935 and 1936 there were 
known to the police 186 cases of murder of 213 persons aged over one 
year. In 74 cases (involving 91 persons) the murderer or suspect com¬ 
mitted suicide. In 105 cases (115 victims) 111 persons were arrested. 
In the remaining 7 cases, involving 7 victims, no arrest was made. Of 
the 111 persons arrested 16 were executed (Whitaker's Almanacks 
1938 and 1939, p. 652). These statements can be interpreted, with 
only slight uncertainty, by the following distribution. 

totals 

n= 1 2 3 

g(n)=92 71 23 186 

ng(n)=92 142 69 303 

The number of deaths, n, connected with the quarrel is here taken to 
include, along with those of the victims, also those of the murderer or 
suspect, whether by suicide or execution. It seems permissible, for il¬ 
lustration imtil better information can be found, to regard the 6 million 
murders in the world from 1820 to 1945 as the number of fatal quarrels 
for n = 1, 2 and 3, and to subdivide this total among its three parts in 
the same ratios as the total for England and Wales is subdivided. Thus 
one finds for the world, the following enlarged values of g(n). To take 
one country as a sample of the world is very unsatisfactory. But it 
should be noted that the uncertainties of these q{n) are insignificant in 
comparison with the vast range of Figure 1 where q{n) varies in a ratio 
exceeding 10“ 

totals 

71 = 1 2 3 

g(n)=3.0 2.3 0.7 6.0lmillions 

7ig(7i)=3.0 4.6 2.1 9.7) 

These give three points in the top left comer of Fig. 1, with the coor¬ 
dinates 

logiow=0 0.301 0.477 

logio g(7^)=6.48 6.36 5.85 

A GENERAL AND WELL DEFINED CONSPECTUS 

This will now be obtained by extending to the wars the method intro¬ 
duced above for the murders. 
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In the region of wars, say for n>1000, the function q{n) is usually 
zero and occasionally unity; so that a diagram of it would be bristly 
and unreadable. Therefore for the wars we should take an average of 
q{n), say g(«). As a preliminary to the definition of J(«), it must be 
imderstood that the fatal quarrels are arranged in order of n. The defi¬ 
nition of q{n) is then 


q{n) = 


number of fatal quarrels in a range of n 
number of integers in that range of n 


( 2 ) 


Briefly, and somewhat inaccurately, g(n) may be called the “average 
frequency of quarrels per unit range of quarrel-dead.” As in all fre¬ 
quency-diagrams so here the choice of the group-range has to be a 
compromise: if the range is too small, the diagram is spiky; if the range 
is too large, essential features are flattened out. The question of the best 
range must be decided by trial. 

The diagram of q and q as functions of n is the logically simple con¬ 
spectus of the frequency of wars and murders. This diagram cannot 
be drawn, because no sheet of paper is large enough to show the facts 
at both ends of the range. The difficulty is overcome by plotting the 
logarithms of g, q and n. The magnitude has already been defined 
by 

M = logio n. (3) 


Another oft-recurring symbol, must now be defined. 

Let logic 2 (w)say, for the wars: while for the murders 

0(m) = logic without averaging. (4) 

The last column of the following table shows apart from some 
corrections for grouping, which will be explained on p. 542. The first 


Range of 
magnitude 

Number of integers 
in range of 
war-dead 

Observed number 
of fatal quarrels 
Years 1820>1946 

logtft (number of 
quarrels per unit 
range of war-dead) 

2.5to3.5 

2,846 

^188 

^-1.180 

3.5 to 4.5 

28,460 

63 

-2.665 

4.5 to 5.5 

284,605 

24 

-4.074 

6.5 to 6.5 

2,846,050 

5 

-5.765 

6.5 to 7.5 

28,460,499 

2 

-7.153 


and last columns of this table provide the abscissa and ordinate of the 
accompanying diagram (Figure 1), as far as the wars are concerned. It 
is evident that the wars can be satisfactorily fitted by a straight sloping 
line. On account of grouping, the wars are represented, not by points 
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as the murders are, but by horizontal segments. It mil be shown 
later (p. 542) that the^Cu) graph should pass slightly above the mid- 
pdnts of these segments. That is to say the sloping straight line was 
actually obtained by a second approximation. 

The relaMon between the ordinates of Figures 0 and 1 
The abscissa ju is the same in both diagrams and the ordinates are 
respectively/and (j>, defined thus 

. , + Q(ni + 1) + 2(«i + 2) + • • • + q(ni + s) 

f = logio <-^5- 

1. logio nt — logio«« 


where 7ii—l <»«<«!, and niH-s<»i<»i+s+l. These extra bits at the 
ends of the range have to be put in to prevent / becoming infinite at 
8 =» 0 . 

, , (2(«i) + q(ni + 1) + 3(ni + 2) H-+ q(ni + s)) 

♦ - ’‘®* i- TTi -1 

which is definite at s^O. By subtraction 


/ — ^ = logic 


{ 


« + l 


logic nh — logic 


n«} ’ 


This expression involves an ambiguity, because «c is loose in a range of 
unity, and so is n*. The practical question however is whether the am¬ 
biguity is greater than the imcertainty of the observations. It is neces¬ 
sary to distinguish different cases. 

Case (i). The range here used in treating the wars is/to— 
where logic n and where /to ^3. In these drcumstanoes arithmetic 
^ows that 

/-^ = MO+ 0.454, (5) 

reliable to the third decimal. Seeing that the m of a particular war is 
often uncertain by +0.1, equation (5) is admirably accurate for the 
purpose. 

Case (ii). We may conceivably wish to consider a shorter range 
Mo—6^M^Mo+c where mo is still in the region of the wars. If o is small, 
but not too small, there is a quasi-limit such that 


/ ~ ^ Mo + 0.362. (5A) 

This was obtained by approximating to the difference-ratio by a deriva¬ 
tive, and was confirmed by arithmetic. 




ft — LOO,, (number of quarrel-dead) 


EiaUBEi 1. TBE WaOUB BANCffl OP FATAL QtTABBELS 

THE HORIZONTAL SCALE EXTENDS FROM A SINGL E PER SON ON THE LEFT TO 
THE POPULATION OP THE WORLD ON THE BIGHT. THE THBEE DOTS AT THE LEFT 
HAND TOP BEPBESENT THE MURDERS. THE SHORT HORIZONTAL LINES REPRESENT 
THE WARS GROUPED BY MAGNITUDES. ONE OP THESE LINES IB DOTTED, BECAUSE 
IT SHOULD BE T>.ATgian TO AN UNKNOWN HEIGHT. 
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Case (iii). If there are only a few integers n in the range then the 
ambiguity becomes troublesome. This is the situation as to the muiv 
ders. The ambiguity of f—<l> is more than that of some criminal statis¬ 
tics. The fault lies in /, not in When «i=1 and s=0 it is hardly pos¬ 
sible to locate ria and rii by any reasonable convention, as already ex¬ 
plained on page 526. To revert to Cases (i) and (ii), either of them give 
the same relation between the slopes of the/- and ^-diagrams. As these 
slopes are in fact negative, it is convenient to write the relation thus 

— d4>/diia = — df/dnt + 1 (5B) 

which shows that the ^-diagram is the steeper. 

The slope in the region of the wars 

This wasfound, by the principle of Maximum Likelihood, [4 & 10] to be 
d<t>/diio = — 1.50. (5C) 

InterpolaMon across ffie gap between the wars and the murders 

The dotted horizontal segment for the small wars represents only a 
lower bound, and otherwise should be ignored. Between the definite 
end-points of the gap, the mean slope is 

d4>/dix = - 2.38. (6) 

This is shown by a straight line of dashes. 

As to the choice of a curve, there are the general clues of simplicity 
and of continuous slope. For simplicity a circular arc was chosen; and, 
for continuous slope, the arc was made tangent to the strai^t line 
which fits the wars. It was then found, as an independent confirmation 
that the slope of the circular arc joins on continuously to that for the 
murders. Some collateral evidence is provided by banditry. 

A SEARCH FOB THE FACTS ABOUT QUARRELS THAT CAUSED 
FROM 4 TO 315 DEATHS (0.5</*<2.5) 

That there were many small fatal quarrels is shown by a statement 
made by Quincy Wright [18] in the introduction to his long list of "Wars 
of Modem Civilization.” He remarked that “A list of all revolutions, 
insurrections, interventions, punitive expeditions, pacifications, and 
explorations involving the use of armed force would probably be more 
than ten times as long as the present list,...” Wright however did 
not classify fatal quarrels according to the number of quarrel-dead. 
Similarly whilst reading histories and police-reports and listening to 
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radio-news I have noticed allusions to very many incidents less impor¬ 
tant than wars, but more important than murders. References f2], 17], 
[11], [15], contain some remarkable examples. However 1 see no hope 
of obtaining world-totals of the numbers of such incidents during the 
126 years to which Figure 1 relates, and classified moreover by m£^ni- 
tudes in the range 0.5 </t <2.5. Any factual test of the interpolated por¬ 
tion of Figure 1 will have to depend on smaller samples. The record of 
banditry in Manchoukuo is comparable on certain assumptions, and 
that of gangsters in Chicago on further assumptions. These ordered 
collections of facts, being rare specimens, are correspondingly valuable. 

BANDITBT IN MANCHOTTKOO DUBING THE YEAB 1935 

This is of especial interest, because the facts are given in a form 
which throws light on aggregation for aggression. The following is 
quoted from the Japan and Manchoukuo Year Book, 1938, pp. 692-95. 

"At the time of the foimding of Manchoukuo in March of 1932 the 
total number of bandits exceeded 100,000. By September of the same 
year the number had increased to 210,000 due principally to the sub¬ 
versive activities of Chang Hsueh-Liang’s remnant troops who were 
thrown out of employment following the downfall of the young mar¬ 
shal. Since then, however the number of such bandits has been on the 
decrease as a result of their suppression by Manchoukuo and Japanese 
forces. 

"Compared with the condition obtainiog in 1932 two factors loom in 
prominence with regard to the bandit situation. Firstly may be noted 
the actual reduction of bandits as a whole, and, secondly, the shrinkage 
in size of bandit groups. In 1932 some bandit groups had an actual 
fighting force of 30,000 men, but at present the average is below 50 
bandits per group. The chief cause for the existence of bandits in 
Manchoukuo is believed to be an economic one, resulting from unem¬ 
ployment.” 

STATISTICS OF BANDITBY 


Size of bandit groups 


ItoSO 

31 to 60 

51 to 100 1 101to200~ 1 201 to 300 1 301 to 500 

>501 



Coxresponding number of raids 


28.145 

4,784 

3,864 1 1,630 1 4S5 1 240 

130 


The first step towards making a comparison with larger fatal quar¬ 
rels is to compmisate for the imequal ranges of size in the above quota¬ 
tion, by dividing each number of raids by the corresponding range of 
size. For example 4784/(50—31-hl)=239.2. Any such ratio may be 
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called the “number of raids per unit range of membership.* For the 
first six groups it runs as follows 

938.17 239.2 77.28 15.30 4.55 1.20 

and shows plainly that the smaller incidents were much the more frequent. 

Was a raid like a small battle or like a small war? In other words did 
the same group of bandits perpetrate many raids? This can be an¬ 
swered by attending to totals. The total number of appearances of ban¬ 
dits can be underestimated from the above table by multiplying the 
least size in each column by the corresponding number of raids, and 
comes to 758,371 or more. A moderate estimate is 1.3 million. 

But the total number of bandits is stated to have been much less 
than 210,000. It is evident therefore that on the average the same 
bandit appeared in several raids. That is to say a raid resembled a bat¬ 
tle rather ffum a wai . 

In the same year and same region the numbers killed in connection 
with banditry are stated thus: bandits 13,338, suppression troops 1361, 
civilians 2512. The total dead was therefore 17,211. This is 1.3 per cent 
of the total number of appearances of bandits. 

The facts about banditry are plotted in Figure 2. This is not a 
diagram, yet it is rather like one. For the abscissa, v say, is logio of the 
number of bandits in the group, whereas n is logio of the number of 
quarrel-dead. Again the ordinate, ^ say, is logio of the average num¬ 
ber of raids per unit range of membership, whereas ^ is logio of the av¬ 
erage number of fatal quarrels per unit range of quarrel-dead. The 
first group in the data has a membership range of from 1 to 30, which 
is in a ratio too great for present purposes. Although the first group is 
shown on Figure 2, it will be ignored in the discussion. The blunted top 
of Figure 1 has however some resemblance to the horizontal in Figure 
2. The other groups are well fitted by a straight line having the slope 

dil>{v)/dv = - 2.29. (7) 

This has a remarkable resemblance to the mean slope, —2.38, across 
the gap in the world-diagram, as stated in (6). Can this agreement of 
the slopes be a mere coincidence? Or is it a clue to a general law con¬ 
cerning aggregation for aggression? It is at any rate easily explained 
by the following two assumptions:— 

Firstly let us suppose that the quarrel-dead were on the average pro¬ 
portional to the number of bandits in the group. That is to say that 
groups continued to make raids until a constant fraction of their mem¬ 
bers had been killed. The supposition is that 

n=AX (number of bandits in group) 
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o 

O 

LOG,^(number of bandits IN QROUPj 

FIQTTKE! 2 

where A is independent of the size of the group. Accordingly 

5= jr + logio A. (8) 

It may not be necessary thus to suppose that the percentage of killed 
was exactly the same for every group, but only to suppose that it had 
no correlation with the size of the group. 
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Secondly let us suppose that the number of raids which a group of 
bandits made, as part of their quarrel with the rest of the community, 
was not correlated with the size of the group. More precisely the sup¬ 
position is that: 

(number of raids made by a group) = Bq{n) 

where B is independent of n, so that 

= ^(ju) + logio B. (9) 

According to these assumptions the if'(j') graph, when slid parallel to 
itself, horizontally through log A and vertically through log B, becomes 
a local <l>(tt) graph. The remarkable agreement of the slopes can there¬ 
fore be interpreted as meaning that one of the characteristics of aggre¬ 
gation for aggression was the same in Manchoukuo as it was in the 
world-total. 

That broad agreement is interesting; but there remain some doubts 
or discrepancies in detail. The Manchurian data do not fix A and B, 
though A can be roughly estimated. For it has been shown that on the 
average 1.3 per cent of the bandits who appeared in a raid were killed, 
and that a group on the average perpetrated several raids. The total 
quarrel-dead, n, may have been about 10 per cent of the number of 
bandits in the group; that is to say A =0.1, so that j«=i'—1. The mean 
value of M on the straight part of the bandit-diagram is accordingly 
about one. If so, there is a discrepancy with the circular arc on the 
world-diagram. For at ju=l the slope of the arc is decidedly steeper, 
namely d<t>/dit=--d.l. The Manchurian data therefore suggest that 
the gap in the world-diagram (Figure 1) should be closed, not by the 
circular arc, but by the straight segment of slope —2.38 leaving dis¬ 
continuities of slope at its two ends. 1 regard that as a suggestion to be 
remembered, but not to be acted on without further evidence. The shift 
/*=»—! cuts off most of the horizontal part of Figure 2 and thus in¬ 
creases its resemblance to the top of Figure 1. 

OANOINO IN CHICAGO 

F. M. Thrasher [16] made a study of 1313 gangs in Chicago with a 
view to their re-direction, or to some other social improvement. 

He defined a gang in these words (p. 57): "The gang is an interstitial 
group originally formed spontaneously, and then integrated throu^ 
conflict. It is characterized by the following types of behaviour: meet¬ 
ing face to face, milling, movement through space as a unit, conflict, 
and planning. The result of this collective behavior is the development 
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of tradition, unreflective internal structure, esprit de corps, solidarity, 
morale, group awareness, and attachment to a local territory.” 

Thrasher gives (p. 319) a table showing the approximate numbers of 
members in 895 gangs. The first two columns of the following table are 
copied from Thrasher, the third column is a deduction, designed to 
smooth out the disparities of the given ranges. The number of members 


Number of members 
in gang 

Number of 
such gangs 

Gangs per unit range 
of membership 

inclusively 



3to 

5 

37 

12.3 

6 to 

10 

198 

39.6 

11 to 

15 

191 

38.2 

16 to 

20 

149 

29.8 

21 to 

25 

79 

15.8 

26 to 

30 

46 

9.2 

31 to 

40 

55 

5.5 

41 to 

50 

51 

5.1 

51 to 

75 

26 

1.04 

76 to 

100 

25 

1.00 

101 to 

200 

25 

0.25 

201 to 

500 

11 

0.37 

501 to 2 

,000 

2 

0.0013 


in a gang is the same sort of quantity as the number of bandits in a 
group; so the same s3nnbol v is here used for logu of the first column. 
Logio of the third colunm is here denoted by x(v) to distinguish it from 
because the number of gangs is not similar to the number of raids, 
though related to it. Figure 3 shows a graph of xC**)- Apart from gangs 
of 15 or less, which show again the blunted top, the xW graph is well 
fitted by the straight line shown, which has a slope 

dxiv)/dv = - 2.30. (10) 

It is remarkable that this slope is almost the same as d4'(v)/dp= —2.29 
for banditry in Manchoukuo. This agreement strengthens the suspicion 
that some fairly general tendency concemii^ aggregation for agression 
is revealed by these otherwise scattered phenomena. Hypothetical ex¬ 
planation is again easy. The distinction between the number of gangs 
and the number of raids would not affect the slope, if it were true that 
the number, C say, of raids made by a gang was independent of the size 
of the gang. For then 

l^C*-) = %(>')+logioC (11) 

and the constant logic C would disappear on taking d/dv. Yet several 
features remain unclear. The Chicago gangs were certainly a^ressive. 
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FIGURE 3 


Thrasher has a chapter headed "Gang Warfare’’ and beginning with 
these words 

"The gang is a conflict group. It develops through strife and thrives on war- 
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fare. The members of a gang will fight each other. They will even fight for a 
‘cause.’. . . Gangsters are impelled, in a way, to fight; so much of their ac¬ 
tivity is outside the law that fighting is the only means of avenging injuries 
and maintaining the code.” 

Thrasher mentioned a few gang-fights which went as far as homicide, 
but he did not give any comprehensive statistics of the quarrel-dead. I 
am indebted to Prof. Lundberg and Margaret Black Richardson, for 
the indication of Thrasher’s work. 

SUMMARY 

The 4>(m) diagram in Figure 1 is the best conspectus of the facts for 
the world as a whole. Alternative tracks are shown across the gap 
where world-totals are lacking. Samples from Manchoukuo and Chi¬ 
cago confirm the slope of the straight alternative, whereas continuity 
of slope is a consideration in favor of the circular arc. 

MORE EXACT CONNECTION BETWEEN THE <f>(/jl) GRAPH OF FIGURE 1 
AND THE NUMBERS OF FATAL QUARRELS IN UNIT RANGES 
OF MAGNITUDE. INTERPOLATION ACROSS THE GAP. 

We need to connect the mean over a range with the value at its mid¬ 
point. It might be thought that the connection was obvious; for a glance 
at Figure 1 shows that the graph is fairly straight; and of course the 
mean ordinate of a straight segment is simply the ordinate at its mid¬ 
point. But that would be a delusion based on a doubly wrong type of 
mean. For here the mean of g, not of the ordinate logjo q, has to be taken 
over the integers in the range of r, and not over the abscissa = logio 
It will be shoAvn that the logarithmic transformation of both coor¬ 
dinates introduces correcting factors which depend on the slope, vanish¬ 
ing when it is cither 0 or —2, but otherwise often considerable. They 
arc obtained from the simplest formula that can fit the facts tolerably, 
namely the formula for a straight line on the 0(iu) graph. The straight 
lines are suitably short segments; for it would be a mistake to smooth 
away significant detail by the over-wide sweep of any too-simple 
formula. In particular the sharp bend near the top of Figure 1 comes 
from British criminal statistics where it appears to be significant; and 
something rather like it is to be seen in the graphs which represent 
banditry in Manchoukuo and ganging in Chicago; ro this Bend should 
certainly not be smoothed away. But elsewhere a short length of the 
graph can be represented by 


4>(jui) = B — Cfi 
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where B and C are positive constants. On taking antilogarithms this 
becomes 

q — 10®n~® 

an inverse power law. To find q this n”® has to be averaged over a range 
of n. As « proceeds only by integers, the average should be made by 
sums, not by integrals. However the integral fnr‘dn gives a useful ap¬ 
proximation to the sum. 

This programme will now be worked out in proper detail. 

The notation is that already defined in connection with the graph. 
The typical range ends, say, at where j«o=7, 6, 5, 4, 3, 2, 1. 

It is convenient to begin with a rough and ready approximation, B, to 
the number Q of fatal quarrels in the range, whereby g(n) at the mid¬ 
point is multiplied by the number of integers in the range of quarrel- 
dead, which it is convenient for a later purpose, to denote by 
where ^ sums for all n in the range of fi. Accordingly " 

n 

R = 2 1 . ( 12 ) 

n 

For example, as read from the circular arc on Figure 1: 


Ranfieof 

magnitude 

M =»logip» 

Ordinate at 
midpoint 

at midpoint 

Number of 
integers in the 
range of quarrel- 

dead 21 

» 

First approidxna- 
tion to number 
of quarrels 

R 

0.5 to 1.5 

3.87 

7410 

28 

207,000 

1.5to2.5 

1.23 

17.0 

285 

4,850 

2.5to3.5 

1.17 

0.131 

2846 

373 


The above easy method is however suspect, because ^(n) vaiies in the 
ratio 1740 in one of the ranges thus represented by its midpoint. So we 
must consider corrections. 

CorrecMon for toidth of range 

As the graph is nearly or quite straight, let us represent a local 
portion of it by 

^0*) = <I>M — c(ji — no) (13) 

where no is the midpoint of the range and c— —d^/dn there. The dis¬ 
tinction between q(,n) and $(n) is one that concerns detailed observa¬ 
tions. It does not arise when, as now, deductions from a smooth curve 
in question. From (13) 
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g(n) = 10* = 10*(''«>+''‘*n“®. (14) 

Let Q denote the total number of fatal quarrels in the range, then 

Q = = 10*('‘*>+*'‘''2»“° (15) 

n n 

So from (12) and (15) 

Q = R10‘'-»2»"'/ Si- (16) 

The coefficient of B will be called the correcting factor. Two verifica¬ 

tions of (16) may be noted; (i) if c=0, then Q=R. (ii) if the range of 
quarrel-dead contained only a single integer n, and if it were located at 
Ho so that 10'‘'>=n, then (16) would become Q—Bn'%~‘/1=R, as it 
should. 

The sum in (16) was evaluated by a method connected with the “in¬ 
tegral test” for the convergence of series, in the following maimer. Let 

ft = lO^o+i/a ^ _ iqmo- 1/2 fop brevity. (17) 


In the following chain of equations, the first is an approximation which 
improves as ho increases, and the other are accurate. 

J .6 1 

=-(5^”® — 0^“®) 

a 1 C 

{ l 0( i -«)/2 - 10(«-»/*}. (18) 


In the special case of c=0, (18) gives the approximation 

1 = 10'‘»(10i'* - 10-1/*) = b-a. (19) 


Strictly 2^nl is the integer next below 6—a; but it is advisable to use 
the same type of approximation in both numerator and denominator of 
(16). Insertion of (18) and (19) into (16) gives the correcting factor 


Q/R== 


1 


1 — c 


IQl/* - 1(1-1/* 


( 20 ) 


This expression is unity when c=0 or c=2. 
For the wars, c=1.50 and Q/iZ=0.855. 
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The observed quantity is Q in the unit range of magnitude. The 
curve should however be connected to ju at a point, and so to 22. We 
have 

logio 22 = logio Q — logic 0.855 = logic Q + 0.068. 

In the graph {Figure 1) the sloping line for the wars was therefore 
raised 0.068 above that which best Jits the observations grouped in unit 
ranges of magnitude. 

In the unexplored region we have, on taking the first approximation 
22 from the previous table; 


World-totals for the 126 years 1820 to 1945 


Range of 
magnitude 

d0 

""" dn 

Correcting 

factor 

Q/R 

Corrected number 
of iaXal 
quarrels 

Q 

Range of number 
killed in a 
quarrel 
n 

0.5tol.5 

3.14 

1.92 

397,000 

4 to 31 

1.5to2.5 

2.32 

1.16 

5,630 

32 to 316 

2.5 to 3.5 

1.87 

0.95 

354 

317 to 3162 


The fourth column shows the improved estimates for the unexplored re¬ 
gion between the wars and the murders according to the circular arc in 
Figure (1). The difficulties of direct counting have already been de¬ 
scribed, and they are emphasized by these large numbers. In the range 
where 354 are expected, I have counted 188. If the straight line in Fig¬ 
ure (1) were accepted instead of the circular arc, 354 would be increased 
to 2530. Although I know that the search is incomplete, I am unable to 
believe that less than a tenth have been found; and so I prefer the circu¬ 
lar arc. 

THE TOTAL NUMBER OP PERSONS WHO DIED BECAUSE OP QUARRELS 
DURING THE 126 TEARS PROM 1820 TO 1945 A.D. 


Ends of range 
of magnitude 

Total number of 
deaths in millions 

How computed 

7±i 

36 ] 


6±i 

6.7 

By summation over a hst of fatal quarrels. 

6±t 

8.4 


4±i 

0.76J 


3±| 

o.sol 

From the circular arc in Figure (1) by the 

2±i 

0.40f 

method desenbed below. 

1±* 

2.2 J 


o±i 

9.7 

From page 528 

\ 

Total 59 
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A remarkable feature of the above table is that the heavy loss of life 
occurred at the two ends of the sequence of magnitudes, namely the 
World Wars and the murders. The small wars contributed much less 
to the total. The total deaths because of quarrels should be compared 
with the total deaths from all causes. There are particulars given by de 
Jastrzebski [9], and in the Statistical Year Books of the League of Na¬ 
tions, which allow the total to be estimated. A mean world population 
of 1.5X10® and a mean death rate of 20 per thousand per year would 
give during 126 years 3.8 X10® deaths from all causes. Of these the part 
caused by quarrels was 1.6 per cent. This is less than one might have 
guessed from the large amount of attention which quarrels attract. 
Those who enjoy wars can excuse their taste by saying that wars after 
all are much less deadly than disease. 

The method of computation in the unexplored region was an exten¬ 
sion of that explained above. The number D of dead in any range of 
is strictly 

D = = Z (21) 

n n 

First a common-sense estimate, E olD was obtained from the graph at 
the midpoint /to of the range /to— i ^/t^j“o+| thus, by (12), 

E = 10'‘»i2 = 10'‘'>+*(»‘»>53 1. (22) 

n 

Then E was corrected by multiplication by a factor wliich was found to 
be approximately 

2 - cl IQI/* - 10-‘/* j' 

The theory of this correction is based on the approximation (13), whence 
it follows that 


/t •+• ^(/t) = C/to + ^0 + /t(l — c). 

(24) 

From (21) and (24) 


n 

(25) 

D 

— = -- 

E 2l 

(26) 
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The sums in (26) were obtained from (18) by suitable alterations of the 
index and they lead to (22). The correction (22) was verified in particu¬ 
lar cases by the “deferred approach to the limit” of Richardson and 
Gaunt [14]. 


COMPAEISON WITH THE MSTBIBUTION OP SIZES OP TOWNS 

Towns and wars are both examples of human aggregation. A referee 
has asked me to compare them. For simplicity 1 take the distribution 
of towns from Lotka’s book [lOA] in Auerbach’s idealized form whereby 
the town of rank r in a given country has a population n such that 

nr = A, (1) 

in which A is a constant, namely the population of the largest town. 
The desired comparison might conceivably be made by arranging the 
fatal quarrels in order of rank; but there would be many artificially 
coincident ranks on account of the rounding off of imperfectly known 
casualties; moreover the gap in the data prevents the extension of rank 
from the wars to the murders. It is preferable therefore to leave the 
fatal quarrels as they are already shown on Figure 1, and to transform 
Auerbach’s law so as to see how towns would appear on a diagram of 
that type. For the purpose of comparison the same symbols will be 
used in corresponding meanings. Thus in this section 

n is the population of a town, and n will be logio n. (2) 

Again, to match the fatal quarrels, q will here be defined as 

number of towns in a range of population 

q — —--(3) 

number of integers in that range of population 

Also, as before, let 

4> = logic q. (4) 

Consider two towns of ranks r’, r", of which r” is the greater. Let them 
mark the ends of a range. The number of towns inside the range is 
strictly But let us follow the usual statistical practice of re¬ 

garding half of an end-object as lying on either side of the end. The 
number of towns in the range is then simply r"—r'. Let n' and n" be 
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the respective populations. With the same convention about ends on 
the population range, we have from (3) 


2 = 



(5) 


Elimination of ranks between (5) and (1) gives 


So that from (4) 


5 = 


A 

n'n" ‘ 


( 6 ) 


<f> = logic A - logic n' - logic n”. (7) 


As for the wars, let no be the midpoint of the range of ju. It is 

no = i(logic n' + logic »")• (8) 

From (7) and (8) 

<l> = logic A — 2no) (9) 

however wide or narrow the range may be. That the constant logic A 
should be the same whatever the grouping is a peculiarity of the slope 

d^/dno = — 2 ( 10 ) 


as shown on page 541, where the correction vanished for c=2. 

This slope for the towns is quite dose to that of the straight line which 
in Figure 1 might join the murders to the world wars. Such a broad resem¬ 
blance between two forms of aggregation is certainly interesting, and 
may suggest theories. Perhaps the most suggestive formula is 

q = An-* (11) 

which follows from (6) when n'is almost equal to n". Rashevsky [llA], 
who gives formula (11), has tried-out several explanations. 

These overall resemblances do not however conduce to an accurate 
description of fatal quarrels; for they distract attention from the curva¬ 
ture of the 4>(p) graph in Figure 1. The slope of the part relating to wars 
is certainly not —2. If it were so, then the number of fatal quarrels 
would increase 10 times for each unit decrease of magnitude. The ob¬ 
served ratio is less than four. 



546 


AMERICAN STATISTICAL ASSOCIATION JOURNAL DECEMBER 1948 


REFERENCES 

[1] Anderson, J. G. 1939. China fights for the world. Kcgan Paul, London, p. 106. 
[lA] Carr-Saunders, A. M. 1936. World Population. Clarendon Press, Oxford. 

[2] Coupland, R. 1939. The Exploitation of East Africa 1856-1800. Faber & 
Faber, London. Chh. VII, XI. 

[3] Durbin, E. F. M., and Bowlby, J. 1939. Personal Aggressiveness and War. 
Kegan Paul, London. 

[4] Fisher, R. A. 1922. Phil. Trans. Roy. Soc. Lond. A 222, 332. 

[5] Glover, E. 1933 & 1947. War, Sadism & Pacifism. Allen & Unwin, London. 

[6] Harding, D. W. 1941. The Impulse to Dominate. Allen & Unwin, London. 

[7] Hume, M. A. S. 1900. Modem Spain. Fisher Unwin, London. 

[8] Japan and Manchouhuo Year Booh, 1938. Tokio. 

[9] Jastrzebski, S. de. 1929 article Death-Rate in Ency. Brit. XIV edn. 

[101 Jeffreys, H. 1939. Theory of Probability, p. 148. Clarendon Press, Oxford. 
[lOA] Lotka, A. J. 1925. Elements of Physical Biology. Williams & Wilkins, Balti¬ 
more. 

[11] Miliukov, P. at tide Russia in Ency. Brit. XIV ed., 19,720 d. 

[11 A] Rashevsky, N. 1947. Mathematical Theory of Human Relations. The Prin- 
cipia Press, Bloomington, Indiana. 

[12] Richardson, L. F. in Nature of 15 Nov. 1941. 

[13] Richardson, L. F. 1945. J. Roy. Statistical Soc. 107,242-250. 

[14] Richardson, L. F., and Gaunt, J. A. 1927. Phil. Trans. Roy. Soc. Lond. 
A 226, 299-361. 

[15] Sleeman, W. H. 1836. Ramaseeana. Military Orphan Press Calcutta, p. 39. 

[16] Thrasher, F. M. 1927, The Gang. Chicago University Press. 

[17] Whitaker’s Almanack, annual, I^ondon. 

[18] Wright, Q, 1942. A Study of War. Chicago University Press. 



BANK RESERVES AND BUSINESS FLUCTUATIONS* 


Clam Warburton 
Federal Deposit Insurance Corporation 

The belief that severe business fluctuations are caused pri- 
luarily by the operations of banks, through their influence on 
the quantity of the circulating medium, was widely held dur¬ 
ing the nineteenth century and the first three decades of the 
twentieth. During the latter part of this period variations in 
bank reserves were believed to bo the chief causal factor un¬ 
derneath changes in the circulating medium furnished by the 
banks. This paper describes a study of factual data Since 1918 
in Ihe light of that theory. To make the investigation, it was 
necessary to develop several new time scries from available 
statistical mateiial, and to make appropriate allowance for 
growth trends. The results support the theory. There is a case 
by case correspondence between the peaks and troughs in the 
scries of effective member bank reserves and the business cycle 
reference dates of the National Bureau of Economic Bescarch, 
with leadership in reserves; and the duration and depth of 
periods of recession and recovery intervening between the 
turning points are closely associated with the duration and 
depth of deviations in reserves from the line of growth. 

I WOULD like to begin with two quotations. The first is from Harry E. 
Miller’s book, Banking Theories in the United States before 1860. 

"The most common explanation of commercial crises. . . was that 
they result from the alternating expansion and contraction of the media 
of payments which occurs wherever a currency of convertible bank 
notes exists.”^ 

The second is from Paul Barnett’s book, Business^Cycle Theory in the 
United States from 1860 to 1900. 

“Monetary theories of cyclical changes in business activity received 
more widespread consideration by writers in the United States from 
1860 to 1900 than any other single group of theories.”* 


* Talk before the Washington Statistical Society, April 26,1048. 

1 Harry E. Miller, Banking Theories in the United States Wore 1860 (Harvard University Bcess, 
1927), p. 81. 

* Paul Barnett, Buamess-Cvele Theory in the United States^ 1860-‘1^X) (Usivexsity of Chioago Brass, 
1941), p. 86. 
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The belief that severe business fluctuations are caused primarily 
by the operations of banks, through their influence on the quantity of 
the circulating medium, continued to be widely held during the first 
three decades of the present century. During this period and the latter 
part of the nineteenth century an important supplement to the theory 
was developed, namely, that variations in bank reserves are the chief 
causal factor underneath changes in the circulating medium furnished 
by the banks. In addition, the need for continuous increase in the quan¬ 
tity of circulating medium because of increasing population and pro¬ 
ductivity was given emphasis; the possibility that habits of use of cir¬ 
culating medium may change was recognized; and various aspects of 
the routes or modes of impact of bank operations were explored. 

To describe with precision aU phases of this banking theory of busi¬ 
ness fluctuations cannot be attempted here. My endeavor to provide a 
concise but reasonably complete statement of the theory has yielded 
the following six points. 

1. Under a fractional reserve system, changes in the amount of com¬ 
mercial bank reserves alter the quantity of deposits and therefore the 
quantity of assets which such banks may hold; and this is followed by 
changes in the actual volume of assets and deposits, because banks of 
deposit have an inherent tendency to expand their operations to the 
limit placed upon them by law and interbank relationships. Since de¬ 
posits form the bulk of the circulating medium, changes in deposits dom¬ 
inate changes in the total circulating medium. 

2. Changes in the amount of circulating medium, expressed as de¬ 
viations from a normal rate of increase accompan 3 dng growth in popu¬ 
lation and productivity and adjusted for any measurable trend in 
monetary habits, alter the equilibrium center of the price system. That 
is, deficiency in the circulating medium relative to a reasonable rate of 
growth produces a falling price level because the output of the economy 
cannot continue to be sold at prevailing prices; while expansion at an 
excessive rate gives sellers of goods an opportunity to obtain higher 
prices and the desire for profits produces this result. 

3. A falling price level, in view of fixed and sticky costs, narrows 
profit margins and induces business to reduce output; while a rising 
price level tends to increase profit margins and therefore stimulates 
output. In addition, reluctance or inability of some business enterprises 
to make rapid dianges in seUiag prices leads to unanticipated inventory 
accumulation when the equilibrium center of prices falls and to di¬ 
minished inventories when the equilibrium center rises. Such changes in 
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inTentories, as well as changes in prices, induce businessmen to curtail 
or expand output. 

4. The foregoing effects are likely to be accentuated by changes in 
velocity of the circulating medium. Contraction of the circulating 
medium is per se a shrinking of the cash holdings of business and individ¬ 
uals, and this leads to more hesitant and cautious use of them. On the 
other hand, rapid expansion of cash balances is followed by a tendency 
to shorten the period that cash receipts are held before their disburse¬ 
ment. 

5. An additional route of impact of changes in bank reserves is 
through accompanying changes in rates of interest and in the availabil¬ 
ity of credit for those who wish to borrow. Since those changes may be 
synchronous with the beginning of a deviation of bank reserves from 
trend, the effects of such deviations via this route of impact may pre¬ 
cede those via the other routes which I have described. 

6. Causation sequences are normally accompanied by time lags. 
Two such logs between changes in bank reserves and sequential busi¬ 
ness fluctuations are important: (a) that which may occur between the 
appearance of the forces operating to change the quantity of reserves 
and the response of commercial banks to those changes; and (b) that 
which may occur between changes in the quantity of circulating medium 
and changes in the level of prices and quantity of output. These time 
lags vary in length. Under some circumstances the total lag may be of 
negligible duration; under other circumstances as much as or more than 
a year. 

This hypothesis, as I have indicated, was the dominant aspect of 
business-fluctuation theory in the nineteenth and early part of the 
twentieth centuries. However, it was far from universally held, and the 
validity of various parts of it was challenged by some economists. 
Efforts at testiog it by factual analysis were mconclusive because of 
inadequacy of data. In recent years the hypothesis has been displaced 
by other theories to such a great extent that few economists have 
deemed it worthwhile to examine m its light the more abimdant data 
now available. My purpose in offering this paper is to describe briefly 
the statistical methodology which I have used in such an examination 
and to state the results. 

Scope of staMstical analysis 

Variations in the value, quantity, and prices of goods and services 
sold to final purchasers, in the form of deviations from appropriate 
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trends, were deemed to constitute the basic phenomena of business 
fluctuations. These, together with the quantity of circulating medium 
(measured by total deposits and currency excluding interbank obliga¬ 
tions), habits of use of circulating medium as expressed in velocity, 
and bn-nk reserves were selected as the principal statistical scries to ^ 
examined. 

The period from 1918 to date was chosen for the study for two rea¬ 
sons. First, since June 1917, required reserves of banks which are mem¬ 
bers of the Federal Reserve System have consisted exclusively of de¬ 
posit balances iu the Federal Reserve Banks. Second, most of the 
necessary statistical series were unavailable for prior dates, nor could 
reasonably reliable series be constructed from the available mforma- 
tion. 

Quarterly averages were used in the study. On the one hand annual 
data are not sufficiently sensitive to the variations commonly desig¬ 
nated “cyclical”; and on the other, monthly data are more difficult to 
work with over a period of three decades and are also subject to short- 
time variations which may be neglected. 

Statistical series 

Tihristing statistics did not provide suitable basic series and these had 
to be constructed. With the exception of bank reserves, the methods 
of constructing them are described in articles which have been pub¬ 
lished in economic journals.* These series have since been revised, with 
slight changes in methodology. With respect to bank reserves, reported 
data for Federal Reserve member banks were adjusted to quarterly 
averages, and then adjusted successively for changes in percentage re¬ 
serve requirements, for shifting of deposits among member banks sub¬ 
ject to different percentage requirements, and for changes in the propor¬ 
tion of deposits held by member and nonmember banks. These adjust¬ 
ments, together with the resulting series of effective bank reserves, are 
shown in Table 3, at the end of this article. 

Determination of trends 

For initial determination of trends in sales of final products and in 
the quantity and rate of use of circulating medium, the period of great¬ 
est stability of output and prices during the three decades was selected. 

* "Normal Fjeoduetion, Income, and Employment, 1945 to 1965,” The Smtihem Economic Journal, 
Vol. XI, No. 3 (January-1945), pp. 219-45; "The Volume of Money and the Price Level Between the 
World Wars,” The Journal of Pditical Economy, VoL UIl, No. 2 (June 1945), pp. 150-63; and "Quan¬ 
tity Freqaency of Use of Money in the United States, 1919-45,” The Journal of PdiHcal Economy, 
Vol. LTV, No. 5 (October 1946), pp. 436-50. 
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At first the seven-year period, 1923-1929, was used, but in view of de¬ 
viations which occurred in 1929 the period was shortened to tihe six 
years, 1923-1928. Percentage growth trends were used, based initially 
on averages for the terminal years of the period, 1923 and 1928, both 
years of high production or “full employment” as that phrase is now 
used. 

The trends thus derived were drawn on semi-logarithmic charts of 
the data for the entire period covered by the study, and were found to 
be excellent fits for this period. In addition, the trend in quantity of 
final products sold was compared with an annual index of production 
covering the period since 1879; and the trend in circuit velocity of de¬ 
posits and currency was compared with the ratio of national income to 
deposits and currency by decades for the period since 1799. The appar¬ 
ent trends for the recent and the earlier decades were similar, and their 
validity as genuine secular trends was deemed to be established. 

Several minor variations in the slopes of the trends were examined 
to see which provided the best theoretical and practical fit. The final 
selection took into account the fact that the level of prices of fina-l prod¬ 
ucts was lower in 1928 than in 1923 by about 2 per cent, and the prin¬ 
ciple that the trend values should be consistent with each other when 
projected forward for a considerable period of time. The slopes chosen 
were as follows: 

Prices of final products: none. 

Quantity and value of final products: 3.6 per cent growth per year, 
or .888 per cent per quarter. 

Circuit velocity of deposits and currency: inverse of a growth in de¬ 
posits and currency relative to expenditures for final products of 
1.35 per cent per year, or .336 per cent per quarter. 

Quantity of deposits and currency: 5 per cent growth per year, or 
1.227 per cent per quaHcr (when expressed as amounts, the 
product of the above two trends). 

Bank reserves; same as quantity of deposits and currency. 

In the final revision, the selected quarterly tre^d rates were applied 
to the 25-quarter period from the last quarter of 1922 to the last quarter 
of 1928, inclusive. Trend values were computed from the quarter in the 
center of the period. 

DevicUionsfrom trend 

The quarterly statistical series were next compared with trend values. 



552 AMERICAN STATISTICAL ASSOCIATION JOCBNAL DECEMBER 1948 

Points of upward and downward departure from trend, and of maxi¬ 
mum deviation, were noted; and also differences in timing with respect 
to turning points and departures from trend in the various series. In 
the case of the transition from the prosperity of the 1920's to the great 
depression, the timing of departures from trend, on the down side, were 
as follows: 

Bank reserves, and deposits and currency—^third quarter of 1928 
Value of sales of final products—second quarter of 1929 
Quantity of sales of final products—^fourth quarter of 1929 
Prices of final products—^first quarter of 1930 
Circuit velocity of deposits and currency—third quarter of 1930 

This sequence was found to be typical, though not perfectly uniform, 
and with considerable variation in the length of the lags. 

One of the important results is the fact that deviations from trend 
in circuit velocity—^particularly in the case of the beginnings of down¬ 
swing—^lag behind deviations from trend in quantity of deposits and 
currency, and except for this lag are in the same direction. This result 
is in conformity with traditional theory that variation in velocity re¬ 
sults from and accentuates the effect of variation in the quantity of 
circulating medium. The assumption which is made in Keynesian and 
other contemporary theories that idle money is an independent or lead¬ 
ing factor in the course of business fluctuations is inconsistent with the 
observed facts. 

Comparison of turning points with business cycle reference dates 

The last element in the statistical methodology to be mentioned here 
was a comparison, for the period 1918-1939, of turning points in bank 
reserves and in deposits and currency with each other and with refer¬ 
ence dates for business cycles. For turning points in bank reserves and 
in deposits and currency, downswings and upswings relative to trend 
of one quarter’s duration only were neglected. Other turning points 
were divided into two groups. Those preceded and succeeded by an up¬ 
swing or downswing of a year or longer were defined as primary turn¬ 
ing points; those preceded or succeeded by a downswing or upswing of 
two or three quarter’s duration were termed secondary turning points. 
The secondary turning points occur in pairs between a pair of primary 
turning points. 

For the period from 1918 to 1939 five pairs of primary turning points 
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aad six pairs of secondary turning points occurred in bank reserves. Of 
the latter, four pairs showed slight amplitude, that is, the difference 
between the peak and trough expressed as deviations from trend is 
about one per cent or less; the other two were of considerably greater 
severity. 


TABLE I 

TUENINQ POINTS IN MEMBEE BANE BESEBVES AND IN DEPOSITS AND 

cuhbency compabed with business cycle eefebence 

DATES, QUABTEBLY DATA, 1618-1939 


Turning 

points 

Effective member 
bank reserves* 

Deposits and 
currencyt 

Business cycle 
reference dates t 

Peak 

2nd 1918 


3rd 1918 

Trough 

4th 1918 

.. 

2nd 1919 

Peak 

Ist 1920 

2nd 1920 

1st 1920 

Trough 

4th 1921 

1st 1922 

3rd 1921 

Peak 

1st 1923 

2nd 1023 

2nd 1923 

Trough 

2nd 1924 

2nd 1924 

3rd 1924 

Peak 

4th 1925 

4th 1925 

3rd 1926 

Trough 

1st 1927 

let 1927 

4th 1927 

Peak 

2nd 1928 

2nd 1928 

2nd 1929 

Trough 

1st 1932 

,, 

Summer 1932 

Peak 

4th 1932 


End of 1932 

Trough 

2nd 1933 

2nd 1933 

1st 1933 

Peak 

4th 1935 

3rd 1936 

2nd 1937 

Trough 

3rd 1937 

2nd 1938 

2nd 1938 


* Based on deviations from trend, at 5 per cent per year rate of growth, in average member bank 
reserves adjusted for changes in percentage reserve requirements, shifting of deposits among member 
banks subject to different percentage requirements, and changes in the proportion of deposits held by 
member and nonmember banks. Includes all peaks and troughs in this senes preceded by upswings and 
downswings, respectively, of a year or longer, and all peaks and troughs preceded or followed by an up¬ 
swing or downswing of two or three quarters* duration with an amplitude of 2 per cent or more. 

t Based on deviations from trend in total deposits and currency, excluding interbank obligations— 
with the same criteria as for member bank reserves. In the omitted cases the quarterly estimates of 
deposits and currency do not show definite peaks and troughs, though changes occurred in the rate of 
expansion or contraction relative to trend. 

t Arthur F. Bums and Wesley C, Mitchell, Meawring Business Cycles (1946), 78 and 82. The 
dates given hero include all of the reference cycles recognised by Bums and Mitchell for the period 
subsequent to January 1,1918, and also the intermediate cycle of short duration but large amplitude 
in 1932, which they mention but treat as a double-bottomed trough without assignment of specific 
reference dates. 


The list of turning points in member bank reserves given in Table 1 
includes all primary turning points and the two pairs of secondary turn¬ 
ing points with considerable amplitude. The business cycle reference 
dates include all those given for the period since 1918 by Professors 
Bums and Mitchell of the National Bureau of Economic Research, to¬ 
gether with the intermediate cycle of short duration and substantial 
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amplitude in 1932 which they mention but treat as a double-bottomed 
trough.* 

It will be noted that there is a case-by-case correspondence between 
the 14 turning points in member bank reserves during the period from 
1917 to 1939 and the same number of National Bureau business cycle 
reference dates. The peaks and troughs in member bank reserves pre¬ 
ceded the business cycle peaks and troughs in ten of the fourteen cases, 
were simultaneous in two cases, and lagged by one quarter in two cases. 
However, in the last two cases the specific data are such as to make un¬ 
warranted a conclusion that in those cases the recovery in business 
activity preceded the recovery in reserves. 

TABLE 2 

BEIATIONSHIF OF DEOBEE OF CHANGE IN THE EFFECTIVE AMOUNT OF 
M EM BER BANE RESERVES TO PERIODS OF BUSINESS 
RECOVERY AND RECESSION, 1918-mi» 


Approximate period of expansion or 
contraction in reserves 

Approximate period of recession 
or recovery 

1918-1919, rapid expansion in reserves 

1920-1921, contraction in reserves 

1922, rapid expansion in reserves 

1923, insufficient growth in reserves 

1924, rapid expansion in reserves 

1925, normal growth in reserves 

1926, contraction in reserves 

1927, rapid expansion in reserves 

1928-1930, insufficient growth in reserves 

1931 and early 1932, drastic contraction in re¬ 
serves 

Summer aud autumn of 1932, substantial expan¬ 
sion in reserves 

Early 1933, drastic contraction in reserves 

Middle 1933 to end of 1935, rapid growth in re¬ 
serves 

1936-1937, drastic contraction in reserves 
1938-1940, rapid expansion in reserves 

1919 and early 1920, business boom 

Late 1920 and 1921, severe depression 

1922 and 1923, prosperity 

1924, minoi depression 

1925, boom 

1926, continuation of prospexity 

1927, depression 

1928 and part of 1929, prosperity 

Late 1929 and 1930, depression 

1932, intensified depression 

Latter part of 1932 beginning of recovery 

Same period, renewed depression and collapse of 
the banking system 

Mid-1033 to 1037, continuous business improve¬ 
ment 

Late 1937 and early 1938, depression 

Late 1938 to Pearl Harbor, prosperity. 


* For description of the concept of effective member bank reserves, see footnote to table 1. 


The relationship of bank reserves to business fluctuations is not lim¬ 
ited to the sequence of turning points; the duration and depth of the 
periods of recovery and recession intervening between the turning points 
are closely associated with the duration and depth in deviations in 
effective member bank reserves from the 5 per cent per year line of 
growtli. The sequences are shown in Table 2. 


4 Arthur F. Bums and Wesley C. Mitchell, MeojavHng Bubineaa Cydes (National Bureau of Eco¬ 
nomic Beeeardi. 1946), pp. 78 and 82. 
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TABUBS 

MEMBEE BANE DESERVES, ANNUAL AND QUABTEBLT AVERAGES, Ulg-1«47 


Year and 
smarter 

Amouni in TnllUmm of dollars 

Rdatives for computation of effective 
amount (1923-1928 bb1) 

Percentage 
ratio to 
trend of 
effective 
amount of 
reserved 

Actual amount 

Effective 

amount 

(1923-28 

basis)* 

Percentage 

requirement* 

Deposit 

shifting* 

Depotits not re¬ 
quiring reserve 
and interbank 
deposHe* 

Total* 

Required* 

Eicesa 

1918 

1,496 

1,889 

114 

1,487 

LOO 

L1S8 

1.119 

96.8 

1 

1,467 

1,377 

90 

1,411 

1.00 

1.163 

1.119 

96.3 

ii 

1,499 

1,298 

201 

1,480 

1.00 

1.166 

1.141 

99.9 

iU 

1,471 

1,319 

152 

1,432 

1.00 

1.154 

1.123 

95.4 

iv 

1,548 

1,535 

13 

1,424 

1.00 

1.158 

1.065 

93.7 

1919 

1,718 

1,686 

89 

1,600 

1.00 

1.188 

1.069 

109.1 

I 

1,633 

1,559 j 

74 

1,479 

1.00 

1.154 

1.045 

96.2 

B 

1,679 

1,607 

72 

1,561 

1.00 

1.145 

1.065 

100.3 

iii ! 

1,743 

1,652 

91 

1,644 

1.00 

1.133 

1.069 

104.3 

iv 

1,817 

1,726 

91 

1,717 

1.00 

1.119 

1.067 

107.6 

mo 

1,886 

1,779 

68 

1,748 

LOO 

1.099 

L0S7 

106.0 

I 

1,873 

1,771 

102 

1,766 

1.00 

1.107 

1.044 

109.3 

ii 

1,859 

1,804 

55 

1,749 

1.00 

1.097 

1.032 

107.0 

iii 

1,821 

1,808 

13 

1,727 

1.00 

1.087 

1.031 

104.4 

iv 

1,786 

1,705 

80 

1,730 

1.00 

1.076 

1.042 

103.3 

mi 

1,679 

1,698 

49 

1,666 

LOO 

1.060 

1.049 

96.8 

I 

1,732 

1,642 

90 

1,709 

1.00 

1.064 

1.060 

100.8 

u 

1,662 

1,629 

33 

1,667 

1.00 

1.067 

1.054 

96.5 


1,630 

1.686 1 

44 

1,620 

1.00 

1.058 

1.051 

93.2 

iv 

1,663 

1,636 1 

27 

1,636 

1.00 

1.059 

1.042 

93.0 

1999 

1,781 

1,788 

48 

1,718 

1.00 

L0B9 

1.016 

94.7 

1 

1,702 

1,641 

61 

1,669 

1.00 

1.056 

1.029 

93.1 

ii 

1.779 

1,739 

40 

1,710 

1.00 

1.054 

1.013 

94.9 

Ui 

1,807 

1,786 

22 

1,730 

1.00 

1.053 

1.008 

94.8 

iv 

1,834 

1,787 

47 

1,774 

1.00 

1.046 

1.012 

96.0 

ms 

1,87S 

1,894 

49 

1,878 

LOO 

1.019 

1.019 

98,8 

1 

1,897 

1,821 

76 

1,871 

1.00 

1.031 

1.017 

100.1 

a 

1,870 

1,833 

37 , 

1,869 , 

1.00 

1.021 

1.020 

98.7 

iii 

1,860 

1,820 

30 

1,862 

1.00 

1.014 

1.021 

97.2 

iv 

1,874 

1,832 

52 

1,889 

1.00 

1.011 

1.019 

97.4 

1994 

9,09$ 

1,969 

61 

1,989 

LOO 

1.019 

LOOS 

99.6 

1 

i.«oo 

1,837 

69 

1,913 

1.00 

1.011 

1.015 

97.5 

ii 

1,943 

1,903 

41 

1,929 

1.00 1 

1.015 

1.008 

97.1 

iii 

2,079 

1.997 

82 

2,080 

1.00 

1.023 

.999 

100.9 

iv 

2,162 

2,112 

50 

2,082 

1.00 

1.027 

.089 

102.3 

1995 

9,166 

9,180 

86 

9,184 

LOO 

1.010 

.996 

101.7 

i 

2,163 

2,104 

59 

2,101 

1.00 

1.016 

.987 

101.9 

a 

2,132 

2,102 

80 

2,109 

1.00 

1.007 

.996 

101.1 

iii 

2,167 

2,115 1 

42 

2,140 

1.00 

1.007 

.999 

101.3 

iv 

2,214 

2,200 

14 

2,187 

1.00 

1.009 

.997 


1998 

9,909 

9,191 1 

18 

9,919 

LOO 

.999 

1.000 

m.s 

i 

2,214 

2,197 

17 

2,201 

1.00 

1.004 

.998 

101.7 

a 

2,196 

2,172 j 

24 

2,207 

LOO 

.996 

1.001 

100.7 

m 

2,208 

2,184 

24 

2,216 

1.00 

.998 

1.002 

99.9 

iv 

2,217 

2,210 

7 

2,222 

1.00 

.997 

.999 

99.0 
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TABLE 3 (CONTINUED) 
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TABLE 3 iCQNTlNUBD) 


Year and 
quarter 

Amount b miUionB of dollars 

Rdatives for computation of effective 
amount (1923-1928:*»1) 

Percentage 
ratbto 
trend of 
effective 
amount of 
reserves* 

Actual amount 

Effective 

amount 

(ig23>28 

basis)* 

Percentage 

requirement* 

Deposit 

sbiftbg* 

Depodts not re^ 
quiring reserve 
and interbank 
deposits* 

Totali 

Required* 

Excess 

me 

5,988 

3,476 

9,519 

4,008 

1.95 

1.185 

.998 

111.6 

I 

6,669 

2,761 

2,908 

4,666 

1.00 

1.131 

.931 

132.4 

ii 

5,474 

2,840 

2,634 

4,477 

1.00 

1.131 

.925 

125.5 

iii 

6,129 

3,723 

2,406 

3,307 

1.50 

1.137 

.920 

91.6 

iv 

6,681 

4,579 

2,103 

3,583 

1.50 

1.140 

.917 

98.0 

iosr 

e,8t9 

5,603 

1,996 

9,910 

1.94 

1.119 

.991 

77.9 

i 

6.722 

4,850 

1,872 

3,103 

1.75 

1.134 

.916 

83.8 

ii 

6,878 

6,759 

1,118 

2,817 

2.00 

1.122 

.919 

75.2 

iii 

6,800 

5,958 

842 

2,820 

2.00 

1.113 

.923 

74.3 

iv 

6,917 

5,845 

1,073 

2,899 

2.00 

1.106 

.927 

75.5 

19S8 

7,931 

5,41S 

9,516 

8,881 

1.81 

1.190 

.995 

91.7 

i 

7,246 

5,818 

1,428 

3,043 

2.00 

1.106 

.929 

78.3 

ii 

7,645 

5,192 

2,453 

3,650 

1.75 

1.112 

.929 

92.8 

iU 

8,161 

5,194 

2,967 

3,837 

1.75 

1.123 

.924 

96.4 

iv 

8,673 

5,458 

3,215 

3,993 

1.75 

1.137 

.916 

99.1 

1989 

10,su 

5,957 

4,887 

4,571 

1.75 

1.159 

.898 

100.0 

i 

8,992 

5,562 

3,430 

4,101 

1.75 

1.139 

.909 

100.5 

U 

9,902 

5,774 

4,128 

4,449 

1.76 

1.151 

.905 

107.7 

iii 

10,808 

6,072 

4,736 

4,735 

1.75 

1.166 

.894 

113.3 

iv 

11,674 

6,421 

5,253 

4,999 1 

1.75 

1.181 

.885 

118.1 

1940 

13,946 

6,999 

6,894 

l,4SS 

1.75 

1.197 

.868 

195.6 

i 

12,187 

6,579 

5,608 

5,141 

1.75 

1.187 

.876 

120.0 

ii 

13,128 

6,799 

6,329 

5,468 

1.75 

1.195 

.871 

126.1 

iU 

13,595 

7,015 

6,580 

5,595 

1.75 

1.201 

.865 

127.4 

iv 

14,074 

7,294 

6,780 

5,729 

1.75 

1.206 

.859 

128.9 

19JH 

18,407 

8,078 

5,899 

5,906 

1.81 

1.919 

.849 

113.6 

i 

14,105 

7,586 

6,519 

5,679 

1.75 

1.212 

.854 

126.2 

U 

13,483 

7,796 

5,688 

5,394 

1.76 

1.214 

.850 

118.4 

iii 

13,103 

7,994 

5,109 

5,224 

1.75 

1.214 

.847 

113.3 

iv 

12.936 

8,935 

4,001 

4,525 

2.00 

1.208 

.845 

97.0 

1949 

19,649 

9,m 

9,678 

4,US 

9.00 

1.187 

.896 

91.7 

1 

12,890 

9,590 

3,301 

4,456 

2.00 

1.215 

.840 

94.3 

U 

12,594 

9,820 

2,774 

4,293 

2.00 

1.219 

.831 

89.8 

iu 

12,444 

10,182 

2,262 

4,185 

2.00 

1.222 

.822 

86.5 

iv 

12,668 

10,313 

2.356 

4,719 

2.00 

1.090 

.812 

96.3 





A B 



A B 

A B 

1943 

19,698 

11,117 

1,511 

5,107 4,798 

9.00 

1.080 

.874 -890 

101.1 94.9 

i 

13,177 

11,268 

1,909 

4,982 4,982 

2.00 

1.095 

.828 .828 

100.4 100.4 

U 

12,345 

10,631 

1,713 

4,965 4,656 

2.00 

1.079 

.868 .814 

98.9 92.7 

iii 

12.624 

11,330 

1,293 

5,240 4,783 

2.00 

1.077 

.894 .816 

103.1 94.1 

iv 

12,366 

11,238 

1,128 

5,239 4,752 

2.00 

1.067 

.904 .820 

101.8 92.4 

m 

13,990 

19,170 

1,050 

6,107 5,199 

9.00 

1.055 

.978 .899 

115.1 98.0 

I 

12,572 

11,484 

1,087 

5,516 4,918 

2.00 

1.061 

.931 .830 

105.9 94.4 

ii 

13,018 

12,069 

947 

5,981 5,036 

2.00 

1.060 

.974 .820 

113.4 95.5 

m 

13,127 

12,051 

1,074 

6,226 5,206 

2.00 

1.049 

.995 .832 

116.7 97.6 

iv 

14.165 

13,075 

1.088 

6,704 5,636 

2.00 

1.048 

.992 .834 

124.1 101.3 
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TABLE 3 (CONTINUED) 




Amount b mllUons of doUara 

Relatives for computation of effective 
amount (1923-1928sl) 

Percentage 
ratio to 

Year and 
quarter 

Actual amount 

Effective 

amoont 

(1923-28 

basis)! 

Percentage 

requirement* 

Depodt 

shifting 

DopositB not re¬ 
quiring reserve 
and interbank 
deposits* 

trend of 
effective 
amount of 


Total! 

Reqiured! 

Excess 

reserves! 

m 


16,998 


A B 

7,640 6,181 

8.00 

1.068 


A B 

181.8 109.9 

i 

14,173 

13,144 


6,807 5,694 

2.00 

1.038 

.997 .834 


li 

15,064 

13,971 


7,321 6,022 

2.00 

1.038 


132.3 108.8 

In 

15,049 

18,925 


7,422 6,244 

2.00 

1.026 

1.012 .837 

132.5 111.1 

Iv 

15,915 

14,673 


7,810 6,543 

2.00 

1.024 


137,7 116.4 

IBJiS 

15,967 

14,990 

977 

7,664 6,669 



.988 .844 

185.9 114.8 

i 

15,715 

14,556 

1,157 

7,618 6,495 

2.00 


.984 .839 

132.7 113.1 

U 

15,751 

14,776 

975 

7,392 6,680 

2.00 


.947 .843 

127.2 113.2 

m 

16,070 

15,168 

902 

7,285 6,734 

2.00 


.913 .844 

123.8 114.7 

iv 

16,331 

15,458 

873 

7,159 6,867 

2.00 


.882 .846 

120.2 115.5 

1947 

16,457 

15,589 

868 

7,046 6,989 



.857 .850 

114.7 118.8 

1 

16,135 

15,293 

842 

6,962 6,833 

2.00 


.863 .847 

115.5 113.4 

fi 

16,m 


801 

6,894 6,805 

2.00 

.997 

.858 .847 

113.0 111.5 

iU 

16,666 

13,664 

901 

7,066 

2.00 


.854 

114.4 

iv 

17,107 

! 

16,177 

930 

7,250 

2.00 


.851 

115.0 


1 From moatbly averages of daily figures, Banking and Manelafy Staikiiat PP* SfiO-TO and 396, and issues of Fsdaral Bemte BulMn 

im-im. 

>For 1920-1947, Banking and Monelary StatMtia, p. 396, and issues of Federal Bemte BuUdin, 1942-1948, mth amount for first 
q;aarter of 1933 estimated in part from weddy data. For 1919-1928 estimated from (a) required reserves on oall dates nearest June 30 and 
December 31, and (b) ratios of average total deposits other than interbank in reportmg member banks for quarters immediately preceding 
and subsequent to June 30 or December 31 to the amount for the oall date, vith a slight adjustment on the basis of reported average 
required reserves for 1929. Data from Banking and Monetary Statutice, pp. 132-42,369-71,373-75,387,395, and 396 and issues of Federal 
Raerve BtiUetin, 1919. 

* Averagetotalreservesadiustedby relatives of percentage requirements, deposit shifting, and deposits not requiring reserves (divided 
by the first tvro and nmlUpUed by the third). Annual figures are averages of quarterly figures and may differ from those enmputwl from 
the average annual relatives. For the difference between columns A and B, 1943-1947, see footnote 6. 

* Ratio of percentage reserve requirements to suoh requirements during 1923-1928. Change In percentage requirements during a 

quarter are treated as in foree throughout the quarter, in view of the fact that in most Instances the changes vrere in advance, 

l^uotioa in reserve requirements for central reserve city banks only In 1942 is not taken into consideration here (see footnote 5). 

* Proportion (ff net demand plus time deports in member banks b each of four eat^orles (net demand deponts b ecntral reserve dty 
banks, net demand deposits b reeerve eity banks, net demand deposits b country banks, and time deposits), weighted by the relative per¬ 
centage reserve requirements (13,10,7 and 3, respeetivdy, untU June 1942, and 10,10,7 and 8, respeotivdy, subsequent to that date), 
converted to idafives with the average for 1028-1928»1. IheseperoentageBtake into oonsidetation tb change m reserverequfrementsb 
1942 applieahifi only to banks b central reserve cities, Deposit figures for 1929-1947 are from monthly or semi-monthly duly averages, and 
those for 1918-1928 are bterpolationB from deposits on call report dates (from Banking and Monetary StoHAkit pp. 73,81,87,93, and 99; 
annualreporfs of the Board of Covemon of the Federal Reserve System, 1931-1937; andissues of the Federal BaeoreeBvMn, 1938-1948). 

* Interpolated from ratio of total depouts adjusted to total net demand plus time depodis b member banks for June 30 and Deoembw 
31 of each year, converted to rebtivos vrith the average for 1923-1928»>1.0. Data for total deposiis adjusted from Banking and Monetary 
fitstisiioBtpp. 34-36. vtitb figures for December 31,1917-1922, interpolated, and issues of FederalBeeme BuOdin, 1942-1948; data for total 
net demand plus time depouts b member banks from Banking and Monetary Staiitike, pp. 73-75, and issues of Fedoral Beeerte Aiffotin, 
1942-1948. For 1943-1947, fignrei m edumn A refer to data bcluding, those b odumn B to data evfthiding . United States government 
deposits. 

’ Trend values are computed at 1.227 per oent borease per quarter; and are based on the average effective reserves during the 25- 
quarter period from the4th of 1922 to the 4th of 1928, bduave, centred at the middle quarter of that period. 






































THE OKDERING OF n ITEMS ASSIGNED TO k RANK 
CATEGORIES BY VOTES OF m INDIVIDUALS 

Gabbst L. Schuyler 
Rear Admiral, USN {Retired) 

A method is proposed for arranging in order of acceptability 
a set of n items (research projects, candidates for positions, 
articles used, viewed, or smelled) on the basis of judgments of 
m judges. Unlike previous methods, this permits each judge 
independently to group in any manner those items tied for his 
favor, and to omit voting on any items concerning which he 
cannot decide. The sampling probabilities for estimating ac¬ 
curacy are not here considered. 

INTRODUCTION 

W E MAY STATE in the following most general terms, the problem 
faced by a panel of judges ranking items, projects or persons ac¬ 
cording to some immeasurable quality. 


PROBLEM 

Items Pi, P 2 , • • •, P» are to be arranged in order by votes of indi¬ 
viduals Ml, il(f 2 , • • •, Afm. Let Ui be number of items concerning which 
individual is too imdecided to express his opinion, and g., r*, s„ • • • 
numbers of items he assigns to categories a, b, c, • • • of decreasing 
worthiness. Then upon what criterion C/ is the position of item P, 
among the other items best based? 

SOLUTION^ 

Consider a table in which each column corresponds to an item and 
each row to a voter, with letters such as a, 6, c, • • • to indicate the rat¬ 
ings of the several projects, or u for “undecided.” The letters a, 6, c, • • • 
are to be replaced by numbers in the same relative order, and the totals 
of the columns are to be taken to indicate the order of excellence to be 
assigned as a composite judgment to the items. 


^ M. O. TfandaH: Tii. Advaiieed Theory of Statistioa. QtiSn, 1M3, p. 410. 
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For this purpose the first rank numbers to suggest themselves are 
the midranks 


(l/2)(gi + 1), qi + (l/2)(ri + 1), g, + n + (l/2)(8i + !),••• 

in the blocks of items receiving o-votes, 6-votes, c-votes,.. . respec¬ 
tively of member Mi. But the average of the (n—«<) rank numbers 
which Mi assigns on this basis is only (l/2)(n-l-l—«,); and this func¬ 
tion of the number of it-votes varies from member to member. 

To have rank numbers from all members’ votes properly additive, 
the average rank numbers for all members’ votes should be equal. To 
effect this it seems simplest to multiply each member’s midrank num¬ 
bers by an adjustment factor 


Ai = 


n -f 1 
n 4- 1 — 


( 1 ) 


which will make the average rank numbers for o-votes, 6-votes, etc. of 
each member equal to (l/2)(n+l). Also, a rank number (l/2)(n4-l) 
should be assigned for each it-vote. 

If Bai, Bbi, Rei, etc. and Rui are the rank numbers thus assigned to 
items receiving respectively o-votes, 6-votes, c-votes, etc. and it-votes 
of member Mi, we thus have 

Rai = A,il/2)(qi -f 1), 

Bbi = A.(g< -f (l/2)(r.- + 1)), 

Rei — Ai{qi + ri -J- (l/2)(sf -1-1)), 


Rui = (l/2)(n + 1). 

The criterion for item Pj is then the sum Cj of the rank numbers R of 
the votes of all members on this item. 


PLEXIBILITT OF THE METHOD 

These equations do not restrict panel members to the use of any par¬ 
ticular number of categories or ranks, nor do they even require that all 
members use the same series of ranks in the same voting. A member 
may or may not leave the ratings of individual items imdecided. 

The minimum possible number k of different ranks which a member 
may meaningfully assign is obviously 2. When all members use the 
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maxiintun value n of k, the order of the C, is the same as that indicated 
by Kendall^ without consideration of the more general problem here 
described. 

THB CASB k = Z 

Principal interest seems most likely to lie, however, in less idealized 
cases where, unlike judges encountered in the textbook examples, panel 
members will not 

(a) agree to vote on every item (i.e., to use no it-votes), nor 

(b) agree to avoid tie votes (i.e., to use as many ranks as there are 
items). 

Recently, for example, panels of 15-20 voters were required to ar¬ 
range series of 40-122 problems in order of worthiness for retention on 
a possibly reduced research budget. Few members had specialized on 
all parts of the field covered. Nearly every member encountered items 
on which he felt himself insufficiently well qualified to cast a possibly 
decisive vote. 

It was soon agreed that members might withhold opinions on any 
items with which they felt themselves too unfamiliar; and their com¬ 
ments showed certain members of some panels would cast only it-votes 
on a majority of the items. 

Again very unlike those most obli^g judges encountered in text¬ 
book examples, these panel members declined to use dozens of ranks. 
The most they could be i)ersuaded to use was only three ranks (cor¬ 
responding perhaps to rough classifications of “very worthy,” “worthy” 
“less worthy”). 

As a practical illustration consider Table I in the first four blocks of 
which are shown (a) identifying numbers of voters, (b) their votes on 
two very closely competing items Pj* and Pas, (c) their numbers of 
votes on all items, and (d) rank nxunbers, and their averages, for items 
Pat and Pat, 


CBITEBIA WHEN k = Z 

In spite of its comparative neglect in textbooks this case in which 
no voter discriminates more than three ranks is sufficiently common to 
have developed an approximate or “1, 2, 3 method” for its treatment. 


^ For completeness one must mention also the possible, thons^ I think less promising methods, by 
which the order of items (a) is so selected as to minimize the number of ^^versions” of rank order in 
relation to ranks assigned by the various members’ votes or (b) is taken in accordance with net numbers 
of votes which rank each item superior to other items or (c) is based upon ^^Scores for Ranked Data,* 
Table XX, Fisher and Yates' "Statistical Tables for Biological, Agricultural, and Medical Research” 
(1943). 
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TABLE I 

ACTUAL DATA OP VOTES ON ITEMS Pa AND Pa OF A 4HTEM 
SERIES VOTED ON BY A 15-MEMBER PANEL 


Jf 

Votes oa 
Items 

Pa Pa 

a 

All Votes 

b 6 

It 

Rank No's. 

By (7,/n 

Pa Pa 

•Scores" 

Pa Pa 

Rank No’s. 
ByCiVn 

Pa Pu 

1 

e 

e 

16 

9 

16 


33.5 

33.5 

3 

3 

34.7 

34.7 

2 

e 

b 

11 

16 

13 

1 

35 

20 

3 

2 

34.7 

21 

3 

h 

b 

16 

16 

9 


24.5 

24.5 

2 

2 

21 

21 

4 

b 

b 

9 

22 

10 


20.5 

20.5 

2 

2 

21 

21 

5 

c 

It 

9 

13 

9 

10 

35.4 

21 

3 

— 

34.7 

— 

6 

h 

b 

22 

11 

8 


28 

28 

2 

2 

21 

21 

7 

b 

e 

22 

13 

6 


29 

38.5 

2 

3 

21 

34.7 

8 

b 

e 

11 

17 

13 


20 

35 

2 

3 

21 

34.7 

9 

c 

b 

15 

15 

11 


36 

23 

3 

2 

34.7 

21 

10 

e 

e 

13 

13 

15 


> 34 

34 

3 

3 

34.7 

34.7 

11 

a 

b 

17 

7 

17 


9 

21 

1 

2 

7.3 

21 

12 

c 

h 

15 

16 

10 


36.5 

23.5 

3 

2 

34.7 

21 

13 

e 

e 

15 

12 

14 


34.5 

34.5 

3 

3 

34.7 

34.7 

14 

b 

b 

18 

14 

9 


25.5 

25.5 

2 

2 

21 

21 

15 

b 

e 

24 

9 

8 


29 

37.5 

2 

3 

21 

34.7 





Means 


Means 


Means 





15.5 

13.5 

11.2 

.7 

28.7 

28.0 

2.40 

2.43 

26.4 

26.7 



Max 

24 

22 

17 

10 

±5.9 

±5.9 

±.63 

±.49 

±7.6 

±6.7 




9 

7 

6 

0 

(0Q.d.) 

(m.d.) 

(m.d.) 


Votes 

Item P» 1 7 7 0 
Item Pa 0 8 6 1 


SxampUi For judRO Mt 

Be *1^1(9+13+4(9+1)) *35.4 
««.*H41+1)«21 


With this the a-votes, 6-votes and c-votes are respectively assigned the 
arbitrary weights 1, 2, 3 and items are ranked according to weighted 
averages. 

When a/, 6y, cj are numbers of ct-votes, 6-votes, and c-votes respec¬ 
tively which item Py receives from all members, the criterion often used 
is 

ay -f 26y + 3cy 


+ 6y + Cy 
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This gives an approximation to the order specified by the Cj in that, if 
each voter places exactly one-third of the items in each category, the 
order among the numbers C/ is the same as that among the C,. In other 
cases the order may differ. 

The ordering of the items according to C/ has the dubious property 
of making a c-vote and an a-vote coimt always the same as two 6-votes 
for an item. Furthermore, this method makes a ^^•vote count the same 
as the average of all other votes on the mme item, whereas ordering 
by the C, makes a w-vote count the same as the average of the votes on 
all items. 

With moderately large numbers of voters these differences cancel out 
surprisingly well. For example, in the voting on 41 items by a 15-mem¬ 
ber panel cited above, there was only one pair of items whose relative 
rank differed by the two methods. These were the items P 28 and P 33 
for which the detailed vote is shown in Table I. Even in this single 
case the margin can doubtless be ascribed to chance. 

CONCLUSIONS 

The simple and rough ordering according to C/, carried out to the 
nearest tenth of a unit, and with differences of less than two-tenths of 
a unit between pairs of items disregarded, will probably meet most 
accuracy requirements encountered in practice, at least if the propor¬ 
tion of undecided votes is as low as in Table I. But the better method 
using Cj involves only a negligible amount of additional labor and, if 
only because of its extreme generality and flexibility, it is believed that 
it should be more widely understood. 

The degree of consistency in data of this sort may be examined in 
various ways, and probability calculations are essential in examining 
it. Interest in this aspect of the problem, and the appropriate methods, 
may vary widely according as the panel is intended (a) to furnish a 
sample of the opinion in some very large and not too select, population, 
or (b) to furnish a record only of its own collective judgments. 



LEVELS OF SIGNIFICANCE FOR VARIANCE RATIO OF TWO 
SAMPLES OF EQUAL SIZE 

C. J. Kiechbn* 

University of Michigan AeronaiUieal Research CerUer, YpsUanti, Michigan 

T his chajrt permits two-tailed probability evaluations of ratios of 
variances of two equal-sized samples at other than the conven¬ 
tional points of 6 and 1 per cent which are 10 and 2 per cent two-tailed 
levels, respectively. The chart was plotted from the tables of Thomp¬ 
son and Meriington (Biomeirika: 33, pp. 73-88), the 20 per cent table 
of H. W. Norton (Statistical Tables for Biological, Agricultural and 
Medical Research, by Fisher and Yates; Oliver and Boyd, Ltd., Edin¬ 
burgh and London), the .1 per cent table of Colcord and Doming 
(Sankhya, ii, pp. 423-424). The plotted points for 19 and 49 degrees of 
freedom were obtained by harmonic interpolation on ni, linear inter¬ 
polation on rii. 

The reader who is interested in acquainting himself with the mathe¬ 
matical basis for this chart is referred to Introduction to ModhemaUcal 
StaUstics by P. G. Hoel; John Wiley and Sons, Inc., New York; 1947; 
pp. 152-164. 

* Fonnerly with Bemingtou Axms Company, Ino, Bridgeport, Conn. 


Corrections to article published in Volume 43^ No, 243, June 1948 

ON ESTIMATING PRECISION OP MEASURING INSTRUMENTS AND 
PRODUCTS VARIABILITY 

Frank E. Grubbs 

Page 244, middle of page: Change Xi, xs, ••• Xi - x, to read 

^1) vthlj • • • Xif • ■ • Xn 

Page 245, second line from bottom: Change Xi+x in second term to 
read Xi—x 

Page 245, bottom line: change Ci to Cj. 

Page 249, line 8: Change xi to Xi and change x,i to Xi 

Page 263, tenth line from bottom of pi^e: Change cu-l-ei, to eu—Cu 

Pt^e 261, formula (30): Change e« to ejy 

Page 264, line 17: Change the word “variance" to read “variation” 
Page 264, foolmote, third line from bottom: Change “use” to read 
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The author wishes to acknowledge with thanks the permission granted by the Bemington Anns 
Go. to publish the attached chart, which was devised in connection with work in which the author was 
engaged with the company. 
























































MAIN EFFECTS AND INTERACTIONS 


D. J. Finney 
University of Oxford 

When the effects of two factors are simply additive, choice 
of estimates can be made entirely on grounds of minimal vari¬ 
ance. When interaction exists, estimation of the magnitudes 
of main effects and interactions requires an arbitrary definition 
of what constitutes a main effect. Whether the class frequen¬ 
cies be equal, proportionate, or disproportionate, the definition 
must conform to the use that is to be made of the results and 
must not be dependent merely on the convenience of an analy¬ 
sis of variance. 

T he analysis of variance is now so widely recognized as one of the 
most powerful techniques for the examination of statistical data 
that its users are sometimes in danger of regarding it as an end in itself 
instead of as a means to an end. In a well designed experiment, for ex¬ 
ample, the analysis of variance of the results and the summarizing 
of the effects of treatments are closely related and may even be merely 
different aspects of the same set of calculations; nevertheless, few 
things betray the inexperienced statistician more readily than a tri¬ 
umphant presentation of an elaborate analysis of variance table 
coupled with an almost complete neglect of any statement of treatment 
means. With fortuitous or observational data, in which symmetry 
may be less marked than in data from a planned experiment, analysis 
of variance is still a valuable method of studying different components 
of variation, but additional care is needed in order to ensure that the 
pattern of analysis adopted is appropriate to the data and to proceed 
satisfactorily from analysis to summary of results. 

In a recent letter to Nature^ Vajda [1] has drawn attention to a 
diflSlculty that occurs in the definition of a ‘^main effect” for an analysis 
of variance of data with unequal sub-class frequencies. Proper imder- 
standing of his point is important even with data having equal fre¬ 
quencies in the sub-classes, for only when the arbitrary nature of any 
definition of a main effect, at least if interactions are present, is appre¬ 
ciated can the analysis of variance and the summarizing of conclusions 
be seen in right perspective. Yates [2] has corrected certain miscon¬ 
ceptions that might arise from an incautious reading of Vajda's letter. 
A fuller statement, however, may be valuable to some whose knowledge 
of the analysis of variance is based more on text books of statistics than 
on their own experience. 
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The problem may be illustrated by reference to data on the interac¬ 
tion of two factors. Suppose that na observations are available on a 
variate x, at a level i of factor A and a levelj of factor B in combination, 
and that the mean yield (using “yield” as a general term for the variate 
under analysis) for this combination AJBj is x,,-. Suppose further that 
p levels of A and q levels of B have been examined; if certain combina¬ 
tions of levels are lacking, some of the na will be zero. The mean yields 
may be set out as a pXq table: 


X 

1 

Level of A 

2 3 • • 

• p 

1 

aJii 

X21 

arsi • • 

• Xpi 


Level of B 

2 

Xi2 

Xn 

Xz2 • • 

• Xp2 


3 

Xi3 

X2Z 

Xzz • • 

• XpZ 



Xlg 

X2q 

XZq • • 

• Xpq 


and the sum of squares of deviations from the general mean may be 
partitioned as follows: 

d.f. 

Between classes pq — \ 

Within classes S8 n,/ — pq 

i j 

Total SS n,j — 1 

1 j 

where S, S indicate summations over all values of the suffixes i, j, re- 

* J 

spoctively.* This much is easy: difficulties arise only when attempts 
arc made to examine particular components of the sum of squares be¬ 
tween classes, and especially those components usually described as 
“main effects.” 

The effect of factor A is said to be independent of factor B if the dif¬ 
ference in mean yield between any two levels of A is the same at each 
level of B, except for variations attributable to random sampling: 
(xij—Xi'j) is dependent upon i, i' but, apart from sampling variation, 
independent of j. That differences in mean yields between any two 


I The "total” and "between eLaaaea” degrees of freedom will be reduced by 1 for every zero value of 
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levels of B are siioilarly independent of the level of A necessarily fol¬ 
lows. The motn effbct of A is then naturally defibaed as the set of differ¬ 
ences between levels of A, and wUl be estimated by fitting a set of 
(p—1) constants for A simultaneously with a set of (g—1) constants 
for B, using the method of least squares (Yates [4]) and weighting 
each Xij with n^. 

In a planned experiment, nn will often be a constant, and the sub¬ 
division of the sum of squares between classes into main effects and 
interaction is then very simple. (Further components for “blocks” or 
other constraints may have to be removed from what has been de¬ 
scribed above as “within classes,” before a residual error mean square 
can be formed, but that does not affect the principles discussed here.) 
The required main effects are the differences between simple arithmetic 
means of columns and rows in the Xf,-table. When nt,- is not constant, in 
general the analysis is more complicated, but the particular case of 
proportional fre^piencies, mentioned by Yajda, can be treated simply. If 


flij 

where Ui is independent of j and is independent of i, differences be¬ 
tween column-means weighted according to frequencies, namely 

Xi. = S UijXijfS nij = S VjXij/S Vj 

j j j j 

are easily seen to be orthogonal with the similar set of contrasts based 
on row means 


x.y = S nijXis/S n,y = S UiXij/S «<. 

i i i i 

Furthermore, in virtue of the well-known property of a mean of ob¬ 
servations wei^ted inversely as their variance, differences between the 
Xi. estimate the main effect of A with lesser variance than would any 
other method of averaging the columns. Similarly the x.j give the most 
ef5.cient estimate of the main effect of B. It is therefore natural to re¬ 
gard the xt. and the x.y as providing the best estimates of the main 
effects from the data. The least squares procedure of fitting constants 
leads to exactly the same results. An analysis of variance, using the 
familiar formulae to give sums of squares for main effects, leaves a 
third constituent of the total sum of squares between classes, with 
(P~l)(ff~l) degrees of freedom, which is appropriate to a test of 
significance of the null hypothesis of independent action of A and B; 
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this is usually described as a s^ificance test ou the interaction, A.B. 

The difficulty to which Vajda draws attention arises only when the 
null hypothesis of independence has to be abandoned. The effects of 
the two factors are then no longer simply additive: the definition of the 
main effect of A, and in particular of the difference between mean 
3 delds for levels i and i', must therefore be re-examined. This difference 
now varies in some manner dependent upon the level of B, and, even 
though ny=Mi»y, considerations other than minimal variance may have 
to determine the estimated average value for the data. To use *<. as 
before is tempting, on account of its computational convenience and 
analytical elegance, for it still leads to the weighted average of quan¬ 
tities having minimal variance and it still gives a neat analy¬ 

sis of variance table. This is the estimation termed by Vajda that of 
“proportionate frequencies”; he appears to prefer to use mean values 
for levels of A and B defined by 

ii. = 8 Xij/q, &.j = 8 Xij/p, 
f 

the simple averages of class means, identical with the Xi. and x.j only 
if ny is the same for all i, j. This he speaks of as the “method of fitting 
constants,” but he uses the phrase in a sense rather different from 
Yates, who reserved it for the estimation of additive main effects when 
interactions were ignored. Vajda’s usage implies the “estimation” by 
least squares of (p?—1) parameters from the (pg—1) differences be¬ 
tween class means xy, so giving a perfect fit. 

Yates [5] points out that “Inasmuch as the frequencies actually 
obtained are likely to be similar to those generally occurring in the 
type of material under consideration, the definition of the main effects 
implied by the use of proportionate frequencies is usually more appro¬ 
priate in practice than that based on equal frequencies.” Whether in 
any one instance the definition is “more appropriate” depends upon the 
particular circumstances, and elegance of analysis alone must not be 
the criterion. Unless the Vj are proportional to frequencies for the q 
levels of B specially interesting to the e^erimenter (as when they are 
relative frequencies in a natural population of which the experimental 
material is intended to be representative), there is no good reason for 
preferring X{. to £{.. The point to be emphasized is that both are ar¬ 
bitrary: Xi. arbitrarily assigns equal importance to each of the q 
values of («<,—»{»,■), whereas Xi. arbitrarily weights these differences in 
proportion to «/. Conceivably, though rarely, the mean values of A 
required in summarizing the data would be defined by some extraneous 



570 


AMBBICAN BTATISTICAL ASSOCIATION JOtJBNAL DECEMBBB 1948 


S 3 rstem of weighting for levels of B, using wei^ts Vj* independent of i, 
but different from vj, to give 

*,• * = S Vj*Xij/8 Vj*. 

J 3 

These, and a similftr set of a:.y* for B, would estimate two sets of main 
effects mutually orthogonal and themselves defining the interaction as 
the residual set of (p—1)(3—1) contrasts. As a method of summarizing 
the results, this would be quite legitimate, though it would not corre¬ 
spond to a simple analysis of variance. Vajda rightly stresses that “the 
definition of an effect or of an interaction should not depend upon 
whether the frequencies are noticed to be such that a change in the 
definition leads to easier computations,” but mistakenly concludes that 
“the method of fitting constants should therefore always be accepted 
as the only legitimate one.” When the interaction is not negligible, 
estimation of the main effect of A must depend upon circumstances, and 
no definition, whether Xi., x,-., or the more general Xi*, can be claimed 
as “the only legitimate one.” The problem forces itself upon the atten¬ 
tion for the case of proportionate sub-class frequencies because alterna¬ 
tive procedures are then very evident. It still exists at the extremes of 
symmetry and asymmetry. The wise investigator will take account of 
this, when possible, by choosing his sub-class frequencies in accordance 
with the proportionate frequencies condition, so as to combine maximal 
precision of estimation with the weighting system appropriate to his 
interests. 

When all the are equal, as in the majority of planned experi¬ 
ments, the assumption that the simple average for each level of A over 
all levels of B gives the ideal estimate of the main effect of A is some¬ 
times made too readily; if there is an interaction between A and B, 
the interpretation of these averages needs to be most carefully consid¬ 
ered. A common situation is that in which the “levels” of factor B 
represent different methods of performing some operation. If the effect 
of A depends upon which method is chosen, no t 3 q)e of average A effect 
can be of much interest unless the q methods tested in the experiment 
either are the only possibilities or are in some way representative of all 
possible methods. An A effect averaged over an entirdy arbitrary selec¬ 
tion of levels of B may give a formal completeness to a summary table 
of results, but is then meaningless or even misleading: the results should 
instead be interpreted in terms of the effect of A for each Bj separately. 
For mcample, in a field experiment on a root crop, factor A might repre¬ 
sent amoimts of nitrogenous fertilizer and factor B different forms of 
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cultivation (inevitably a small selection only from all the methods of 
cultivation that might interest the experimenter). If analysis shows 
the hypothesis of simple additive effects to be untenable, a nitrogen 
“main effect” averaged over several cultivation treatments may be 
little use except as a broad qualitative expression of the results. Any 
detailed interpretation of the experiment will need to examine either 
responses to nitrogen under each system of cultivation separately or 
differential cultivation effects at each level of the fertilizer separately. 

When UijT^UtVj, no simple analysis of variance exists to tempt the 
experimenter to sacrifice meaning for elegance, but again he must not 
assume that the unweighted averages Xi, and x.j are necessarily what 
he requires. In the absence of interaction, the estimation problem is 
always simple in principle, and choice between alternatives can be 
based on their relative precisions. In the presence of interaction, the 
definitions of main effects and interactions are inextricably bound 
together; exact meaning can be given to interaction only as the de¬ 
parture from a specified system of main effects, and definition of the 
estimates of the latter must involve, explicitly or implicitly, arbitrary 
weighting of the effects of one factor at different levels of another. 

The above discussion has been restricted to two factors and their 
interaction, but the generalization to a greater number should be im¬ 
mediately apparent. This note is concerned only with the logical 
basis for speaking of main effects and interactions; the technique of 
estimating them from non-orthogonal data, and of fitting them into the 
framework of analysis of variance, has been described by Yates 
[3, 4] and, in great detail and with wealth of illustrative examples, by 
Snedecor [1]. 
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A TEST FOR SYMMETRY IN 
CONTINGENCY TABLES 


Albekt H. Bowkbb 
Stanford University 

S uppose that a sample of N individuals is classified according to two 
arguments, A and B, in an m by w contingency table and suppose 
that the categories of classification A are associated in some unique 
way with the categories of classification B. For example, in the study 
in which this problem arose the categories were related to the position 
of plates of stickleback (Gasterosteus) and the two arguments corre¬ 
sponded to right and left sides; it was logical to associate identical pat¬ 
terns on opposite sides. Denote by P*/ the probability that a randomly 
chosen individual belongs to the ith row and jth column of the table. 
As usual, a dot indicates summation over the replaced index. 

DISTRIBUTION OF POSSIBLE EVENTS 



Category 

1 

Glassification A 

m 

Marginal 

Distribution 


1 

Pa. 

Pit — — — 

Pim 

Pi. 


2 

Pa 

Pit 

Pnn 

Pi. 

Glassification B 

— 

— 

— 

— 



m 

Pmi 

P«. 

Pmm 

Pm. 

Marginal 

Distribution 


P.t 

P.t 

P.« 



It is desired to test the hypothesis that the above table is symmetric 
(i.e., Pij'—Pji). If this hypothesis is true, then Ph, — Pjk\ that is, the 
marginal distribution of individuals is the same for classifications A 
and B. Further, the conditional distribution with respect to A for the 
Jfcth category of B is equal to the conditional distribution with respect 
to B for the ifcth category of A. This equality of conditional distributions 
in general implies a stronger equivalence between the classifications 
than equality of marginal distributions, although for m=2 equality of 
marginal distributions implies complete symmetry. The weaker hypoth¬ 
esis of equality of maiginal distributions would also be of interest, 
especially in the absence of symmetry; this problem appears to be some¬ 
what more dij0Bicult to handle by straightforward methods. 
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The proposed test for the hypothec of symmetry is of the familiar 
X* type. Let n*/ be the observed frequency in the *th row andyth column 
of a table of observed quantities arranged as above. Then 


( 1 ) 


Z-E-T- 

*>i "T 


has for large N the Chi-square distribution with m(in—1)/2 degrees of 
freedom. A large value of x* leads to rejection of the hypothesis. Note 
that the value of x^ is computed from non-diagonal frequencies, which 
is reasonable since diagonal frequencies should not afford evidence tend¬ 
ing to disprove the hypothesis of sjmimetry. 

In cases in which the observed frequencies are small and no regroup¬ 
ing of frequencies is possible (e.g., in a 2X2 table) an exact test may be 
of interest. Specifically, in a 2X2 table an exact test may be made by 
testing with any tabic of the binomial distribution the hypothesis that 
ni* and n*i are observed “successes” and “failures” in a random sample 
of ni 2 -|-wsi events. From equation (1) we obtain for a 2X2 table the 
following x^ statistic with one degree of freedom: 


( 2 ) 


(«i2 - nti)^ 

»i2 + n2i 


This is equivalent to the value obtained by a test of goodness of fit of a 
binomial with P=J. Formula (2) was obtained by McNemar [1], who 
lists applications in psychological testing. A problem in the same field 
prompted a recent query in The ATnerican Statistician [2]. 

Several intuitive derivations of the above test may be given, but a 
rigorous justification may be obtained from a theorem of Gram4r [3], 
page 426, which puts the use of x^ tests in contingency tables on a sound 
theoretical basis. This procedure affords on interesting example of a 
routine application of the general theory and might well be an exercise 
for a course in the theory of x* tests. After verifyii^ certain regularity 
conditions, the method of Cram4r can be applied by computing the 
quantity 


(3) 


EE 


(mj - NPiiY 
NPii 


where the Pn are replaced by their maximum likelihood estimates un¬ 
der the hypothec of symmetry. These estimates turn out to be 
ni]-\-nji/2N which yield equation (1) on substitution into (3). 
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An alternate derivation is available if (1) is regarded as a sum of 
m {m—l)/2 terms of type (2). This approach requires the assumption 
that we are sampling from a population vdth fixed sums for all sym¬ 
metric pairs of cells. While both methods lead to the same result, the 
more general one based on Cram&*'s theorem seems slightly preferable. 
Until the test procedure is derived, it is not obvious that the use of the 
conditional distribution with fixed totals for symmetric pairs entails 
no loss of efficiency. 

The problem which prompted this note was suggested by Professor 
Frank Weymouth. 
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Effective wartime administration of the industrial system 
required the use of a common factual language with which 
government and industry could communicate. No such sys¬ 
tem existed in the prewar period. An effort to develop a suit¬ 
able quantitive language was begun early in the war, and met 
with some of the most diflSlcult problems encountered by war¬ 
time management. Although substantial progress was made, 
a stabilized and commonly understood system of communica¬ 
tion was never attained. This article discusses some of the 
outstanding problems in connection with questionnaire units 
of measure and terminology. An analysis of the development 
of material and product classification and terminology is fol¬ 
lowed by a discussion of the methods employed to standardize 
concepts relating to general terminology in questionnaire in¬ 
structions, to units of measure, to reporting time periods and 
to end-product usage. 

PRODUCT NOMENCLATXTRE AND CLASSIFICATION 

T he problem of developing uniform identification and classification 
of materials and products was more than a question of simple defi¬ 
nition. Standard definitions of both products and materials were avail¬ 
able. So also were classification systems. The problem lay in widely 
varying practices of industry relating to record-keeping methods and 
definitions, procurement patterns, and commonly used day-to-day ex¬ 
pressions. Into this traditional lack of uniformity were injected rapidly 
changing wartime influences, new products and materials, and substan¬ 
tial modifications and revisions of old products. These conditions lay at 
the base of the broad problem of unifying product and material con¬ 
cepts. 

Product names and other words used to identify them early in the 
war meant various things to different minds and by no means did they 
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always carry either the dictionary meaning or the technical one given 
to them in industry. Under the operation of the early preference rating 
system, products and materials for which preferred treatment was ex¬ 
tended were defined in broad terms, such as bombs, cranes, explosives, 
and tin. It was found very soon that manufacturers in reporting like 
products under this system used so many different names and so many 
varying classifications that identification for tabulation, allocation, and 
issuance of preference ratings became exceedingly difficult. 

As supply relative to demand tightened for an increasingly large 
number of products and materials, and as administrators became more 
familiar with their products and the problems of controlling them, a 
drive began for greater precision of definition. But getting more detail 
intensified the problem of nomenclature since the result was often a 
further entanglement in varying industry usage and variations of a 
given product in grade, size, weight, cost and other characteristics, 
such as horsepower in the case of certain machines, warmth-conserving 
qualities in the case of waterfowl feathers, or wear-resistance in the 
case of leather and textiles. 

Relatively little improvement was noted in more precisely defining 
products and materials given in the Priorities Critical List to which 
preferred treatment could be extended on rating certificates. The move 
for more precision really began with the introduction of blanket ratings 
authorized by P Orders in the spring of 1941. The earliest P Orders 
were couched in vague terms with respect both to the product to which 
preference ratings might be applied and to the materials and compo¬ 
nents to which ratings might be extended. Continuous and often volumi¬ 
nous successive amendments, however, brought greater and greater 
detail into terminology. 

Order P-2 illustrates this evolution. Dated March 26, 1941, this or¬ 
der related vaguely to ‘^material for the production of machine tools.*^ 
No further definition of machine tools was given. But the list of 11 ma¬ 
terials to which rating could be extended to facilitate machine tool pro¬ 
duction with “Machine Tools, consisting of power-di-iven metal work¬ 
ing machines, hydraulic and mechanical presses, and welders,” and 
continued with components such as “motors and other electric equip¬ 
ment,” “abrasives,” “machine parts and equipment,” etc. In the next 
month a series of amendments more precisely defined equipment, viz., 
“power-driven metal-working machines and testing equipment,” “drill 
presses,” “tomio and automatic screw machines,” “nibblers and cutoff 
machinery,” “magnetic chucks and grinding machines,” and so on. 
With each successive amendment the list of specific tools became longer 
and more precise. 



STATISTICS IN THE WPS 


577 


Reporting forms naturally followed the trends established by the 
orders to which they were attached. Early reporting forms provided 
blank spaces for manufacturers to report products coincident with the 
move for more precise identification. Later forms often preprinted on 
the form itself the exact products to be reported. This development, 
obviously, served not only to limit reporting to products specified but 
also facilitated uniformity of response and understanding. 

Limitation orders, first issued in mid-1941, generally described the 
products to which they applied in far more precise terms than early P 
Orders. This was due partly to experience in connection with P Orders 
and partly because the L Orders, as an instrument for limiting produc¬ 
tion of a specified item, had to be precise to insure compliance. In ad¬ 
dition, of course, the limiting of production necessitated considerable 
care because it moved deeply into the entire problem of industrial prof¬ 
its and conversion to war work. 

The definition of trucks and motor vdbicles set forth in L-1, issued in 
August 1941, did not change materially in successive amendments to 
the order or to other L Orders issued to apply to model types originally 
incorporated in L-1. L-26, which was the next to last L Order issued in 
1941, restricted the production of farm machinery, farm equipment, 
and parts. Products covered by the order were listed in great detail. 
The reporting instrument first used under the order, PD-284, preprinted 
each of the approximately 175 specific pieces of machinery and equip¬ 
ment on the stub of the form. This sort of pattern was followed by 
many L Orders and*related forms, such as L-7 for refrigerators and its 
forms PD-176, 177, and 417; L-17 for vacuum cleaners and its forms 
PD-170, 171 and 417; and L-21 for automatic phonographs and its re¬ 
lated forms PD-182 and PD-417. 

Experience of administration brought greater improvement in prod¬ 
uct definition. But among the many orders, regulations and question¬ 
naires, there existed great variation in definitions, a situation which 
although constantly improved lasted throughout the war period. At¬ 
tempts of the WPB Requirements Committee and the various WPB 
Divisional Requirements Committees to direct product production and 
distribution and to allocate individual products imder control were 
constantly impeded by these conditions. Adding and duplicating ma¬ 
chines, for mstance, as used in Service Equipment Division in early 
reports, included both electric and non-electric types. Sewing machines, 
on the other hand, were reported imder the headings: "electric,” 
“hand,” and “treadle.” Benzene as originally reported by the Chemi¬ 
cals Bureau meant both the pure pharmaceutical grade and the much 
lower motor grade. Hot water storage tanks defined by the Plumbing 
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and Heating Division included four products ranging in weight of con¬ 
tained carbon steel from 50 to 1700 pounds, while track laying tractors, 
on the other hand, were subdivided into five weight classes and two 
models. Domestic electric mechanical refrigerators were reported by 
the Consumers Durable Goods Division in terms of units, regardless of 
size or weight. Such lack of system was widespread and was justified 
by administrators as an essential compromise of administration. 

The need for better definitions of individual products coincided with 
the demand for establishing a usable classification to cover the entire 
spectrum of individual manufactured products. The problem of obtain¬ 
ing better definitions and classifications of individual products and the 
industries which manufactured them was not new to government stat¬ 
isticians. Government agencies such as the Bureau of the Census, the 
Bureau of Labor Statistics, and the Bureau of Foreign and Domestic 
Commerce had devoted considerable attention to the problems of prop¬ 
erly classifying industrial data. Each governmental agency, however, 
maintained its own classification system and a given product or estab¬ 
lishment was often classified so dMferently that data collected by one 
could not be related to that of the other. 

To grapple with this problem in the prewar years, a committee was 
formed in 1934 xmder the sponsorship of the Central Statistical Board. 
The work of this group resulted in the publication in 1937 of the 
‘^Standard Industrial Classification.^^ When this volume was com¬ 
pleted a “Standard Materials Classification” was begun but was not 
available in the early defense period when emergency requirements for 
standard materials classifications became important. By 1941, classifi¬ 
cation was a problem of considerable magnitude in the work of the 
0PM. In its efforts to create meaningful summaries of supply contract 
awards and the volume of preference rating certificates the “Standard 
Industrial Classification,” as modified in the 1939 Biennial Census of 
Manufacturers, was the classification system used. 

The groupings used in the Biennial Census of Manufacturers, how¬ 
ever, did not provide for an adequate coverage of all the poducts in¬ 
volved in wartime production. As a result, the Division of Statistical 
Standards of the Bureau of the Budget (successor to the Central Star- 
tistical Board), with the aid of the WPB, The Bureau of the Census 
and other federal agencies represented on the Federal Committee on 
Economic Statistics, imdertook a cooperative effort which resulted in 
the development of the “Defense Program Product Classification.” 


1 At the end of the article. 
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This system subdivided many of the broad categories previously used 
and added new classifications for war products and their complementary 
items. Although the “Defense Program Product Classification” pro¬ 
vided a foundation for the summarization of economic data, it was 
deemed necessary to group the data further to convey information to 
top executives. As a result, when the first PD-275 was issued in Janu¬ 
ary 1942, the established classifications were grouped into 111 new 
product groups covering manufacture in the metal-working industries. 
In addition, seven new groups were established covering such activities 
as shipyards, naval air stations, and proving groxmds. 

Following discontinuance of form PD-276 this classification system 
was expanded and modified to tabulate form PD-25A, used under the 
first comprehensive metal allocation system adopted by the WPB. 
Under the product classification system used for PD-25A it was pos¬ 
sible to classify production data with the specific product to which they 
related, to summarize data to reflect wartime industrial operations, and 
to provide summary information which the WPB could use for product 
administration. 

Despite the great improvement in product classification for tabu¬ 
lation purposes, considerable difficulty arose in the WPB in attempt¬ 
ing to use PD-25A data for administrative purposes. The core of the 
problem rested in the fact that the WPB internal assignment of prod¬ 
uct responsibility varied considerably from the product arrangements 
used in tabulation. In addition, WPB administrators felt that they 
needed more detail than it was possible to get from manufacturers’ 
PD-25A schedules. Both problems could have been solved, but for 
these and other reasons this classification was seldom used for day-to- 
day administrative purposes following the discontinuance of the PEP 
and its form PD-25A in the spring of 1943. For tabulating such com¬ 
prehensive general policy forms as WPB-732, CMP-7 and WPB-617, 
however, this system, continuously modified and improved, remained 
in use throughout the war period. 

With the introduction of the Controlled Materials Plan in the spring 
of 1943 a new product classification had to be devised to fit its opera¬ 
tional needs. Products were divided into two groups: Class A and Class 
B. Class B products originally were those in the component class but 
eventually included end-products as well. These were products the 
controlled material allotments for which were made by the WPB. All 
other products were Class A products and for the most part allotments 
were made to them through Claimant Agencies. Over and above the 
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Class A products were Claimant Agency pri^ams, such as tanks, de¬ 
stroyers, and submarines. Both the Class A and Class B specifications 
of products were designed essentially for administrative purposes with¬ 
out regard to the logic of product nomenclature. 

Material Nomenclature and Classification 

The evolution of material definitions and classification paralleled 
that for products. Efforts to develop adequate information about basic 
materials brought to a head confusion which had existed as far back 
as the early NDAC days. There was little consistency of ideas about 
what constituted a basic material or what shape and form of elements 
composed it. Each industry had its own definitions. So did early war 
administrators. 

Even a basic concept such as steel ingot was frequently misunder¬ 
stood, misapplied and misused. Economists and statisticians, for exam¬ 
ple, in talking about steel thought fundamentally in terms of ingot and 
spoke continually of the capacity of American steel producers as being 
60 million tons of ingot per year. In contrast, the steel industry men 
talked about finished steel, for example, plate, structural shapes, bars, 
sheets, etc., but knew that in their conversation tons of ingot meant 
75 per cent or less of comparable tons of finished steel shapes and forms. 
Various operating divisions of the early war agencies were staffed with 
steel consumers who visualized tons of steel as finished products. They 
spoke of steel in material tons which translated ingot into finished 
fabricated component or end products. In this sense an ingot, depend¬ 
ing upon the product, meant much less than 75 per cent of comparable 
basic ingot tons. Throughout the 1940-1941 peiiod there was continual 
confusion in discussions of potential supply and demand for steel as a 
result of these diverse concepts. 

The languE^e of basic materials was by no means fixed in definitions 
pertaining to broad categories of basic materials such as the distinction 
between carbon steel and alloy steel, between alloy steel and staioless 
steel, and among other grades of steel, non-ferrous metals, and non- 
ferrous minerals. Thus, for example, alloy steel, as the steelmaker 
thoi^ht of it, started at the point where the carbon content was 
rou^y less than .5 of 1 per cent and the manganese, silicon, copper or 
other alloy content was about 1.65 per cent. For many consumers, 
however, such low alloy figures were insufficient to meet their concepts 
of alloy steel. Stainless steel was distinguished from alloy steel by con¬ 
sumers but was generally considered alloy by steel makers. Actually, 
many people in the steel industry loosely defined as carbon steel a sub- 
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staixtial part of the types of steel which non-technicians regarded as 
alloy. The reason was that low alloy content, particularly manganese, 
had always been recognized by steel producers as higher grades of car¬ 
bon rather than alloy. Among steelmakers themselves the distinction 
between carbon and alloy steel was not always clear-cut because it de¬ 
pended on processing techniques as well as chemical content. In many 
cases, what was carbon to one miU was alloy to another. And peace¬ 
time concepts were modified to fit wartime specifications, further blur¬ 
ring the distinction in common usage. 

Even more confusing was the nomenclature of basic material finished 
shapes and forms. Once again, difficulty sprang from varying poptilar 
usage in industry and a lack of familiarity with established nomen¬ 
clature on the part of those whose experience in industry was limited. 
In the steel trade, for example, any finished mill shape more than 3/32 
of an inch in thickness and 47 inches or more wide was considered a 
plate and anything under these dimensions was a sheet. To a manu¬ 
facturer or a steel barrel or drum, however, all drum stock might be 
considered sheet whether or not it was over or under those limits. Some 
shipbuilders ordered “plates” when specifications carried a weight of 
over 25.5 lbs., per square foot. To some consumers, plates were all 
sizes over 7.65 lbs., per square foot and over 48' wide; to others, sizes 
over 10.2 lbs., per square foot; and so on. On other steel shapes also, 
such as rod and bar, structural shapes, pipe and tube; in copper shapes; 
in aluminum shapes; and in other materials; usage was far from uni¬ 
form. What was one man’s sheet was another man’s plate; what was 
one man’s rod was another man’s bar; what was one man’s pipe was 
another man’s tube. 

These differences were reconciled under the early M Orders by re¬ 
quiring each producer or consumer of the material to report informar 
tion in the terms specified on the form, irrespective of the manner in 
which records wore kept. Beyond a particular allocation system, how¬ 
ever, definitions and classification remmned confused. A manufacturer 
who had to report comparable information to several product adminis¬ 
trators found that each used different definitions and classifications. A 
producer of welding electrodes, for example, found that in reporting 
his consumption of alloying elements the definitions of tungsten, var 
nadiiun and molybdenum alloys changed witifi each person with whom 
he dealt. 

Such problems not only created difficulties for manufacturers but 
raised obstacles at many points in the administrative process. Product 
admimstrators found it difficult to unify their treatment among differ- 
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ent manufacturers using the same product, a situation which was re¬ 
flected not only in their administration but also in top policy. In re¬ 
ports of administrators to top staff, separate product administrators 
often came to startlingly different conclusions. It was often difficult 
for policymakers to know whether one report was more accurate than 
another, whether one report used poorer figures than another, or 
whether the conclusions of various reports were really identical but 
expressed differently because of differences in terminology. 

Complications increased as questionnaires for each material under 
allocation demanded greater detail. The tighter supply became in re¬ 
lation to demand the more detail administrators required. Under 
M-l-i, for example, aluminum producers were asked to report on Form 
PD-8 only their primary and secondary monthly shipments by nine 
preference rating categories. This form was promptly replaced by PD- 
26 upon which aluminum producers, refineries and smelters were asked 
to report on twenty specific aluminum shapes and forms. Through 
continuous expansion of reproting requirements aluminum producers 
were finally asked to report on CMP-24 a total of 43 shapes and forms. 
Eventually a listing of 77 aluminum shapes and forms was developed 
but the number for which reporting was required did not go beyond 
43. Comparable expansion in other basic materials such as steel, nickel, 
copper, magnesium, cork and rubber also took place. 

In September 1941, a movement began for the compilation of a 
standard materials list which would define precisely the meaning of 
basic materials, grades, shapes and forms and other species of the same 
genus. By this time the work of the Bureau of the Bu^et, noted above, 
had resulted in the publication of the Standard Materials Classification. 
A new materials list was developed from this volume and became 
known as Materials List #1. It ombraced all of the important metal 
shapes and forms and included in varying detail such important com¬ 
modities as textiles, lumber, miscellaneous minerals, and chemicals. 

When this list was published the Army made known that it also was 
preparing a standard materials list. When the Army and the 0PM lists 
were compared many inconsistencies were revealed. By means of com¬ 
promise, modifications, and adjustments, the official “Bed Book” or 
Offieial Classifieedion List of Raw and Basie Industrial Materials was 
issued in February 1942 as a joint publication of the ANMB, the WPB, 
the Navy, and the Maritime Commission. 

Crystallization of common usage by the publication of the “Bed 
Book” solved the basic problems of material terminology. Thereafter 
the battle was more one of improvement than of struggle with basic 
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concepts. Nevertheless, the use of material shape and form classificar 
tions on various reporting instruments varied •widely. Items on Mate¬ 
rials List No. 1 were used for the first time on form PD-25A. Through¬ 
out the year 1942 and in the early part of 1943, PD-25A continued to 
incorporate this classification. In the meantime, various questionnaires 
used in connection with M Orders, inventory controls and P Orders 
continued their expansion of material shape and form reporting re¬ 
quirements. Since the use of neither Materials List No. 1 or the “Red 
Book” classification was compulsory, material breakdowns on these 
questionnaires often varied considerably from the standard listings and 
PD-25A. 

The movement towards an expanding list of material shapes and 
forms for copper, aluminum and steel allocation controls was cut short 
with the introduction of the CMP. For allotment purposes the CMP 
materials listing was short. The 85 shapes of carbon and alloy steel 
used on PD-26A were reduced to two CMP groups; carbon steel and 
alloy steel. The 24 copper and copper-base alloy shapes and forms of 
the PRP were reduced to four under the CMP, and the 21 aluminum 
shapes were lumped into 8 shapes and forms. 

Other materials allocated by the PRP such as magnesium, zinc, 
nickel, and lead were not included under the CMP. But by the time 
the CMP was introduced their classification had become more or less 
standard. Condensation of shapes, and forms, such as took place under 
the CMP, did not occur •with most other materials under individual 
allocation programs. 

The decision to telescope basic material shapes and forms resulted 
from a combination of circumstances. To begin •with, an expanded list 
of shapes and forms such as that used under the PRP would have 
made CMP completely unworkable, particularly in light of the first 
proposal to issue allotments on a monthly basis, to collect detailed bills 
of materials for all products and programs, and to ticket each allotment 
•with a doemnent as it filtered throu^ industrial processes. Second, 
there was a natural revulsion from the lengthy listings included in PD- 
25A which became cumbersome both from the standpoint of manu¬ 
facturer response and WPB administration. Third, experience ■with 
previous metal-allocation systems had demonstrated that the WPB’s 
control, at least as far as authorizations were concerned, could be just 
as effective on the basis of broad material groupings as on that of de¬ 
tailed shape and form breakdowns. Individual controls could be used 
to resolve mill production and distribution problems for ti^t shapes 
and forms. 
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Not all the CMP operations were in terms of the telescoped groups 
noted above. In the original draft of the CMP there was established 
material shapes and forms under each main controlled material as fol¬ 
lows: carbon steel, 13; alloy steel, 10; copper, 9; and aluminum, 33. 
Some CMP and related forms required reporting in variations of these 
shapes. CMP-26 (steel producers’ daily reports of accepted authorized 
controlled material orders for carbon and alloy steel), for example, 
listed 46 carbon and 33 alloy steel shapes and forms for reporting pur¬ 
poses. CMP-22 (producers’ report of shipments of steel castings) re¬ 
quired data concerning 10 grades of castings. Aluminum suppliers re¬ 
ported shipments and unfilled orders on CMP-24 by 43 shapes and 
forms, and cumulative authorized orders accepted for delivery on CMP 
12 by 22 shapes and forms. For copper the same sort of situation ex¬ 
isted. On form WPB-3112, (brass mill application for copper, copper^ 
base alloy, ingot and scrap) reporting was in terms of 10 different types 
and on WPB-3508 (brass mill program report) the respondents re¬ 
ported in terms of 4 brass mill shapes. These reporting instruments 
were typical of a large number of forms pertaining to mill shipments, 
applications and requirements. 

There is considerable doubt whether telescoping of shapes and forms 
imder the CMP was completely desirable. While it is true that sum¬ 
marization of shapes for the three controlled materials eliminated a 
substantial amount of postii^, the absence of detailed shape and form 
information led to rules and regulations which might not have been 
necessary with more basic factual detail. Condensation under the 
CMP was partly based on the assumption that shape and form details 
would be available for products on prototype bills of materials. But 
few accurate bills of mateiials were ever received. No particular prob¬ 
lems wore created so long as fabricating facilities were balanced with 
the requirements of end pr<^ams and mill capacity was sufficient to 
satisfy the really essential demands. Under the operation of the CMP, 
however, there was a constant shifting of the degree of tightness among 
various material shapes and forms, in and out of detailed sizes, and in 
various shape qualities and characteristics. This experience necessi¬ 
tated the introduction of new methods to plug the various holes when 
they opened. The net gain, therefore, was not so great as might appear 
at first. 

General Terminology 

Precise meanings of many terms and phrases had to be clearly stated 
in each questionmure to assure uniform responses. The problem of 
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nomenclature, therefore, was more or less as broad as the complete 
lexicon of the spoken language. 

WPB questioimaires showed continual improvement from loose and 
ill-conceived instructions on many early questionnaires to quite clear- 
cut and precise instructions on later questionnaires, oftentimes rein¬ 
forced by reference to well-defined regulations and orders. But no mat¬ 
ter how carefully instructions were written, they were always subject 
to criticism, misinterpretation and misimderstanding. On the one hand 
were those who objected to the “legalistic” and precise terminology 
used, and on the other hand were those who deplored the looseness of 
phraseology. Some manufacturers objected to the insufficient use of 
technical and “shop language” whereas others complained of phrasing 
which was subject to misinterpretation. Our interest here is centered 
lai^ely in a recognition of such problems together with a few typical 
illustrations of points of difficulty, rather than in a definitive analysis 
of either the problem or of its solution. 

The term “capacity” was continually misunderstood on WPB re¬ 
ports. On WPB-3002 (manufacturers monthly report of imfiUed orders 
and schedule of shipments) and WPB-2578 (report on speed reduction 
units), for example, manufacturers were required to show in units their 
productive capacity and schedule of deliveries for each type and size of 
product. Producers of electrical testing equipment, resistors, electric 
motors and generators, speed reduction units and unmounted gears in¬ 
veighed against the term “capacity.” They insisted that since two or 
more models could be manufactured with the same facilities and the 
same wage earners, the productive capacity for any given size and type 
depended upon the facilities in use at any given time, and that one 
month’s operations could require heavier production of one model and 
lighter production of others, while in the next month the situation could 
be reversed. Even though instructions often stated that capacity by 
typos should follow the proportions established by past actual produc¬ 
tion and that all capacity data ought to be concurrent, some manu¬ 
facturers maintained that productive capacity for each model was 
something which did not exist in fact. Other manufactxirers alleged that 
the term “capacity” was too indefinite. They were not certain whether 
it meant theoretical capacity, practical operating capacity imder the 
existing conditions of material, labor and machine tool utilization, or 
actual capacity based on past performance. Although the definition of 
the term was apparently clear to those in the WPB responsible for its 
use (the term generally meant the maximum monthly rate of produc¬ 
tion of each item which could be reached and maintained with the 
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equipment available, assuming the availability of materials and the 
maintenance of maximum hours of operation) to industry the term 
lacked meaning. 

The phrase “delivery schedule” raised considerable confusion. WPB- 
1801 (production of electric motors and generators), for example, re¬ 
quested for each model whose production was reported the standard 
delivery time in weeks. Industry replied that there was no such thing 
as a “standard delivery time” for a particular product. To have signifi¬ 
cance to a manufacturer, the phrase “delivery schedule” had to relate 
to shipments, since a manufacturer scheduled his production against 
dates of shipment from his plant and not dates when delivery was made 
to a customer. Closely connected with these words was the term “re¬ 
quested for delivery.” WPB-3002 and WPB-2678 (noted above), for 
example, called for units of product or dollar value scheculed for ship¬ 
ment and requested for delivery by months. Instructions asked the 
respondents to specify the total dollar value or number of units in¬ 
cluded in the total unfilled orders broken down by the amounts that 
purchasers requested in each of the indicated time periods. Industry 
insisted, however, that it could not develop the data from existing 
records, nor could it see how the data could be developed. This resulted 
from industry’s interpretation of the data request to mean the amoxmts 
originally requested by customers before placing their orders, informa¬ 
tion which of course was unobtainable. The intention of the WPB was 
to compare the items scheduled for shipment with those requested for 
delivery so as to be in a position to determine the extent to which pro¬ 
ducers were f ailing behind schedule. 

These illustrations are no more than indicative of a problem which 
baffled both the WPB and industry throughout the war period. There 
were hundreds of other words and phrases which caused considerable 
difflculty, such as idle, excessive, obsolete, usable and nonusable in¬ 
ventory; net shipments; principal product; captive operations; main¬ 
tenance, repair, and operating supplies; repair parts; and so on. Ex¬ 
perience demonstrated that a considerable period of time was often 
required before words and phrases were so defined that they were 
mutually comprehensible by the WPB and industry. It also left little 
doubt that for best results terminology used in questionnaires had to 
conform as closely as possible to that generally used in industry and 
that the more precisely a term or phrase was defined the less opportu¬ 
nity existed for misinterpretation and consequent inconsistency in data 
reported by manufacturers and used by administrators. But what con¬ 
formed to industrial usage and what was precise were determined only 
by long and hard experience. 
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Industrial Reporting Units 

An important, but often overlooked question was that of the precise 
area to be covered by a single questionnaire. One of the basic common 
denominators determining the usefulness of collected information was 
the industrial unit for which reporting was to be made. The basic com¬ 
mon denominators in practically every industrial statistical report of 
the WPB were: (1) individual products (or materials), (2) individual 
industrial plants, and (3) companies. The structure of industry, and in 
turn the subject matter of production and distribution control, de¬ 
manded that this be so. 

The fundamental distinction between a product (or material) report 
and a plant report was simple. All questions in a product or material 
report related to specific products or materials; individual products or 
materials constituted the subject of the report. The plant report on the 
other hand used an industrial establishment in one geographical locar 
tion, or a subdivision of the plant, as the broad subject and required 
information about all products, all materials and other productive 
elements receiving attention within the plant area. As such, of course, 
a plant report was an aggregation of product or material reports but 
covered all products and materials as well as other relevant productive 
elements of a plant. In some plant reports all reported data about a 
plant related to individual products made by the plant or individual 
materials used in the plant. A company report was often a consolidation 
of either individual product or plant information, seldom the latter, 
on a company-wide basis irrespective of geographical location of individ¬ 
ual plants or the production locale of individual products or materials. 

These distinctions between plant and product reports were hazy in 
the earliest surveys. Fundamental problems confronting the 0PM in 
early 1941 concerned industrial capacity for certain products or ma¬ 
terials, inventories, and production and use of a limited number of indi¬ 
vidual materials and products in short supply. Early questionnaires 
sought facts about individual products and materials without a con¬ 
scious distinction between the plant and the product approaches to 
informational requirements. Reports had to be sent to plants or com¬ 
panies for completion, not to products. If the needed information re¬ 
lated to capacity or to inventories the report generally used a plant as 
the reporting unit, for data relating to such subjects were reportable 
and had meaning only in terms of a plant. If, on the other hand, the 
schedule concerned individual products or materials the result usually 
was a product report which confined questions to an individual product 
or material. Frequently, however, these reports (OPM-6 and OPM-23), 
for instance, combined questions as to capacity with data requests for 
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individual products or materials with the result that the distinction 
between a product and a plant became blurred and a questionnaire 
often in fact used both a product and a plant as a basic reporting meas¬ 
ure. But as the war progressed the distinction between the two report¬ 
ing bases became clearer. 

Reporting by industry in these fundamental units was seldom uni- 
fcam from survey to survey for the same questionnaire or different 
questionnaires. Thus, for example, specific data on one plant report, 
such as WPB-732, might include a universe slightly different from that 
reported on another plant report, such as PD-25A. Great variation 
existed from report to report because of flexibility permitted in report¬ 
ing and variation in the field of activity covered by a given set of in¬ 
dustry records. Even greater variations existed in factual reporting for 
specific products. 

Product and Material Units of Measure 

Closely allied to the names and classifications of products and mar 
terials were their units of measure. Here again the WPB found wide¬ 
spread differences in reporting of standard quantities and values. And 
here again the trend was towards greater precision in definition; and 
more general use of those standards which it was found were mutually 
understood by both industry and govenunent, readily reportable by 
industry and acceptable for administrative purposes. Measurement 
units generally were necessarily fixed accordii^ to industry usage and 
industry records, particularly when they related to past activities. Un¬ 
fortunately, however, usage was not idways standard among plants and 
industries and usage common in industry was not always of most 
utility to government. 

Material units of measure generally fell into two distinct groups, or 
else defied classification. The most widely used was a wei^t measure. 
Most basic materials, particularly the metals, originally requested 
reporting in terms of tons or pounds. Even these simple concepts cre¬ 
ated difficulties. Some questionnaires requested information in long 
tons, others in net tons, and still others in short tons, generally without 
defining the standard specifications pertaimng to the concept used. 
Some forms mixed the reporting requirements, such as PD-209A (re¬ 
port of vanadium consumption), and data were requested in both long 
and short tons with no definitions of either. The result was a 12 per cent 
variation in reporting. Experience led the WPB to request reporting for 
most materials, when tonnage was specified, in short tons of 2000 
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pounds each. Reporting in pounds presented no particular problem be¬ 
cause a poimd was solidly entrenched as a standard unit. 

For those materials which were incorporated in other basic mate¬ 
rials at the TTiill level, particularly the ferroalloys, reporting was often 
required in terms of the content of these other materials. PD-358 (pro¬ 
ducers application for molybdenum for melting), for example, requested 
the per cent of molybdenum in finished products. PD-359 (producers 
report of molybdeniun containing metal) requested reporting by pounds 
of molybdenum content. Complications developed from time to time 
in the use of such measures. Nitrogen-containing materials were at first 
measured in tons of ammonia content, then ammonia equivalent, and 
finally in tons of calculated nitrogen content. 

Units of measure outside these areas were too diverse for ready 
classification. PD-338 (inventory of jewel bearing material) measured 
jewel bearing materials, such as natural sapphire and synthetic ruby, in 
carats. PD-236 (consumption of jewel bearings) established reporting 
in terms of the nruuber of jewels contained ia bearings. Mercury was 
measured in flasks, phosphates in terms of equivalent phosphorus pent- 
oxide, ethyl alcohol in gallons of 190 proof, penicillin in billions of 
oxford units, and leather m terms of equivalent hides and skins. 

Three main types of units of measure were commonly used in con¬ 
nection with products. The first and most desirable unit for products, 
and the one most frequently used, was one unit of a multiple of the 
product in question. This was satisfactory when the product was uni¬ 
form, such as hot drink cups, telephones or kerosene raUroad lanterns, 
but could not well be used in the case of products presenting diversity 
in important characteristics. The imit classification was strained in 
numerous instances in sunomary tabulations, as in the case of steam 
and diesel locomotives, scales and balances and various machines. 
Second, many orders established controls on the basis of the tonnage of 
iron and steel or other metal used in the production of the controlled 
products during a base period and reports were made in terms of rmits, 
pounds, or value of the fiboished product. This material content ap¬ 
proach, of course, was limited to availability of records and was not as 
widely used as the other methods. Third, when the use of units and size 
or weight measures was unsatisfactory, dollar value measurement was 
used. This was inherently unsatisfactory as a control measure because 
of lack of information as to quantity and quality of production and dis¬ 
tribution, number of units, quantity of raw material and labor required, 
and changes in prices. It was used in lieu of anything better and at the 
same time was practical. 
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Confounding problems often arose because of the use of varying 
units of measure and the difficulties of settling upon a satisfactory con¬ 
version formula to secure comparability of related data. The Foreign 
Economic Administration, for example, received from the WPB allocar 
tions of leather and textiles in units of equivalent hides and linear and 
square yards and pounds of cloth or yam, but issued licenses and made 
requisitions in terms of shoes and finished textiles. Months were re¬ 
quired to develop conversion factors and standardize terminolc^ and 
measurement units. Comparable problems were encoimtered in many 
other products and materials. 

Difficulties with varying units of measure resulted in a trend in the 
WPB, continuing through the war years, to require reporting of ma¬ 
terial and products in terms of individual units such as quantity units 
for products and tons and pounds for metals. The coincident develop¬ 
ment of product and material definitions and classifications when re¬ 
lated to uniform units of measure made the data collected by the WPB 
increasingly more useful with each step forward in these areas. 

Tima Periods Covered on Reports 

Under the rapidly changing conditions of a war economy a year was 
too long a period to wait for reports upon which to base policy, and time 
periods moved from quarters, to months, to weeks, and often to days 
depending upon the urgency of a problem. From time to time products 
were allocated on a daily telephonic basis. Broad material allocation 
programs, however, could not operate on so short a time period. Many 
of the individual allocation systems in metals existing up to and through 
the Production Eequirements Plan operated on a monthly allocation 
basis. The PRP itself was based on quarterly reporting and allocation. 
This, it was felt was too long for many tight items, and the early pro¬ 
mulgation of the CMP envisioned monthly allocation. It was quickly 
perceived, however, that such a time period would involve a tremen¬ 
dous amount of paperwork, conferences and decision writing. As a con¬ 
sequence the time period instituted with the first operation of the plan 
was the calendar quarter. As circumstances permitted, other alloca¬ 
tion plans quickly followed the quarterly allotment procedure, and 
the calendar quarter finally evolved as the best time unit for making 
allocation, scheduling production and reporting on implementation. 

Measures and D^nitions of End-Use Data 

The general approaches to production control embodied in the pref¬ 
erence ratiiig system, P Orders, and M Orders, encountered a problem 
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relating to identification of end programs at sub-manufacturing levels 
which frustrated the WPB throughout the war period. Early operators 
of material allocation controls recognized that their decisions concern¬ 
ing mill shipment authorizations were not particularly intelligent in 
the absence of information relating to the uses, particularly the end- 
uses, to which the basic material ultimately was to be put. This type of 
information was difficult to acquire because no workable system existed 
for passing information from the top to the bottom of the industrial 
system. 

There were many different approaches to the development of end- 
use information. Throughout the year 1941 and into early 1942 con¬ 
stant difficulties with old methods resulted in their revision or the 
institution of completely new systems, each striving to obtain infor¬ 
mation about end uses which was considered vital to the operation of 
each allocation plan. By the spring of 1942, after almost one full year of 
experience, methods to derive end-use information were little closer 
to their goal than in the spring of 1941. 

The approaches current in the spring of 1942 may be classified into 
several rough categories. (1) The general approach was adopted in 
PD-69 (consumers pig iron order for shipment) which requested appli¬ 
cants to give a complete description of the ultimate use or product to 
be made from pig iron for which application was made. With little help, 
applicants were asked to be very specific, i.e., “Pump castii^ for 
Navy ships.” (2) PD-123 (producers report on copper and copper alloy 
deliveries) embodied what may be classified as a use approach. This 
form originally contained 37, and ultimately 100, classifications of end 
uses for which producers were to report deliveries of various copper and 
copper-alloy shapes and forms. On the basis of the classification ^ven 
it was not only impossible for producers to file completely, but one end 
product conceivably could appear in three or more classifications as set 
forth, depending upen its ultimate use. As a result, tabulations of this 
form were often weird and meaningless. (3) The ProdvcM/yfe approach 
may be illustrated by PD-180 (customer application for phenol). The 
applicant was obliged to enter the name of the product into which he was 
incorporating phenol and the specific ultimate use of that product. (4) 
The use and product approach was typified by PD-197 (producers re¬ 
port of lead and lead alloy shipments). On this form 33 classes of prod¬ 
ucts were preprinted in the stub, such as storage batteries, cable 
covering, and white lead. A cross-classification was requested showing 
the amounts of lead used in each of the preprinted products. 

The persistence with which this type of data was soi^ht and the 
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willingness with which the WPB introduced rapid changes in past meth¬ 
ods, each more complicated than its antecedent, testify to the impor¬ 
tance attached to this sort of information and the desperation with 
which it was sought. Neither the material procurement agencies nor 
the WPB commodity divisions were in a position to match demand- 
supply relationships on a factual basis. As a result it not only was im¬ 
possible to assign the resources of the country to those uses of most 
importance, but it was difficult to determine the desirability of expand¬ 
ing production of critical materials. The key to the puzzle seemed to lie 
in the development of machinery by which accurate identification of 
end products manufactured by prime contractors could be transmitted 
throu^out the entire industrial system, throu^ intermediate produc¬ 
tion and components, to the basic metal mill producers. Such a system, 
it was reasoned, would allow the WPB to follow the distribution of 
basic materials and components of each important end-product and 
assure its completion on a balanced schedule and in conformance with 
a carefully pre-arranged principle. 

To meet these objectives a bold experiment in end-use identification 
was prepared in June 1942 with the issuance of Priority Regulation 
No. 10. The regulation observed that its purpose was “(1) to obtain 
standardization and reduce the rules of different forms for allocation 
purposes that industry now must submit to the War Production Board, 
and (2) to furnish information needed by the War Production Bomd 
in the allocation of materials.... To allocate intelligently,’’ it went on, 
"it is necessary to know the subdivision of the war, industrial and 
civilian programs for which materials are going to be used. For example, 
it is essential to know whether the materials are goii^ to tanks, or de¬ 
stroyers, or raiboads, or to office machinery and supplies in order to 
determine allocation policy. In addition, it is necessary to know in a 
general way what type of purchasers will eventually receive particular 
products of industry, Le., whether the product will eventually be de¬ 
livered to the Army, Navy, etc.” 

The theory behind the code was simple. Each purchase order from 
a prime contractor would include an end-use code indicating the use to 
which the purchased materials or products were to be put. Each sub¬ 
contractor would then repeat the code symbol on his orders for mate¬ 
rials and parts until, in theory, the symbol appeared on orders placed 
for basic materials. Suppliers of critical materials would then be in a 
position to report their shipments by end use codes, which would re¬ 
veal information on both the end use and end user. It was recognized 
that this plan would not be feasible in every case, but it was thoi^t 
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that the band of unidentified end use could be narrowed and that the 
system would provide the first real check on the relationship between 
end products and scarce materials. 

The WPB materials and commodity Divisions violently protested 
the use of this system. Having developed end-use reporting require¬ 
ments to fit their particular needs, they insisted that the classifications 
in the end use code were so broad as to be almost meaningless. The 
purpose of this code was to standardize the many existing codes and 
to keep the number of classifications to a TniniTnnm , so as to reduce the 
burden on industry. Yet if the code did not meet the needs of the 
Divisions it was doomed to failure. To say this is not to say that the 
end use codes developed by the Divisions were workable. On Hie con¬ 
trary they lacked uniformity and could not be tabulated. But they did 
provide the narrow bands of end use which the Divisions wanted and 
which the newly devised code lacked. The theory behind the code may 
have been sound, but it fell down in its practical applicability. 

Serious criticism also came from many industries. The steel industry, 
for example, complained that it was faced with serious difficulties in 
the application of the regulation to its normal busiaess practices—so 
much so that the data received by the WPB from steel mills had little 
value for allocation purposes. Fundamental problems of the steel in¬ 
dustry lay in three areas. First, steel warehouses were required to 
analyze monthly all their purchase orders to determine the iffiocation 
classification pattern which they would use on their own purchase or¬ 
ders which they placed with steel suppliers in the following month. 
Since the warehouses received thousands of snuffi orders each month, 
the accounting and tabulating problem was staggering. Moreover, since 
the Steel Division of the WPB thoi^ht it could not and therefore would 
not allocate steel in accordance with the information furnished by the 
classification system, all the paper work involved was useless. Second, 
since the wardiouses added extra charges for cutting, handling, etc., 
analysis of the sales pattern by dollar volume as required by the regu¬ 
lation, was misleading. Third, since the Steel Division of the WPB re¬ 
quired information according to the nine piurchaser classifications used 
under its own Order M-21 instead of the five purchaser symbols used 
imder PR-10, the industry had to keep two sets of records. 

Following a short but turbulent life the system was abandoned in 
November 1942 as a failure. It was cumbersome in use, unworkable in 
large segments of industry, particularly at the component level, and 
produced unrddable order identification on mill order books. The con¬ 
cept was valuable but the methodology was poor. 
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Throu^out the experience with Priorities Regulation No. 10, it was 
becoming clear that satisfactory control over war production could not 
be answered by a suitable end-use system slone. It was but one ingre¬ 
dient in a far broader need which embraced the balancing of end pro¬ 
grams among themselves in relationship to the totality of scarce ma¬ 
terials; the scheduling of materials through the industrial system to 
end products, in terms of quantity and time; the adjudication of com¬ 
peting claims at critical levels of fabrication and direction of mill fabri¬ 
cating facilities. 

Following the failure of Priorities Regulation No. 10, the Production 
Requirements Plan provided part of this needed method. The introduc¬ 
tion of the Controlled Materials Plan in 1943 furnished a practical 
mechanism by which end program symbols could be transmitted down 
through the contractual chain to mill levels for controlled materials. 
The plan accomplished this simply by requiring that a controlled mar 
terial allotment and program symbol had to accompany each purchase 
order if the product ordered used any one of the controlled material 
shapes and forms. Not only were prime consumers requested to use 
major program symbols to identify orders carrying controlled material 
allotments but they also identified rated orders for class B products for 
which they did not reallot quantities of controlled materials. By means 
of this procedure end-program identification, based on CMP allot¬ 
ments, was provided for a very large part of American industry, from 
procurement agencies through to metal mills. 

The system by which Claimant Agency symbols were passed down 
to Class B producers was not designed to provide precise identification 
of either Claimant Agencies or programs. To begin with, many manu¬ 
facturers operating imder P, U, and other orders and regulations used 
a variety of spcciahzod symbols not identifiable in terms of Claimant 
Agencies. Second, rated orders from prime consumers did not always 
carry Claiment Agency symbols. Finally, extension to Class B pro¬ 
ducers by prime consumers was in terms of Claimant Agencies them¬ 
selves, and books were not maintained showing program symbols. The 
result was a broad specification of end-use on CMP-4B applications. 
Despite many difficulties in identification growing out of weaknesses in 
parts of the sjrstem, the Controlled Materials Flan solved practically, 
from an administrative point of view, the end-use problem for con¬ 
trolled materials. The solution, however, was not so much imbedded in 
a method of classification as in a complete new comprehensive allocar 
tion system. 
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Individual allocation systems continued to exist for materials and 
products other than those under general comprehensive programs such 
as CMP, Lumber, and Rubber Tires. The methods for determining end 
uses were compromises between the desires of the WPB and the records 
of industry. As might be expected the methods varied considerably 
and incorporated the same features which were used imder M Orders. 
As a consequence they engendered the same pre-CMP problems of com¬ 
pliance, the same type of industry criticism and the same collection of 
inadequate data. For many materials and products the problem was 
never solved satisfactorily. The discipline created by Priorities Regula¬ 
tion No. 10 and the operation of the CMP helped, but industrial prac¬ 
tices negated passage of complete end-use information to raw material 
sources without a superimposed method such as the CMP allotment 
chain. 

The length of time required by the WPB to develop satisfactory 
definitions and classifications should not be viewed too critically. Of 
course the struggle was much influenced by the fumbling which char¬ 
acterized the entire evolution of production and distribution controls. 
But the matted problems of definitions and classifications were, even 
beyond that, hard to disentangle. Even when definitions were clear cut 
new circumstances inevitably arose to blur them. In some fundamental 
aspects, the problem of nomenclature specification was insoluble. What 
was a sound definition depended upon the persons using it, industry 
usage, the state of classification systems and the precise needs of a 
given control policy in which it was a part. The broad definition of a 
material such as carbon steel, to cite one instance, was reasonably satis¬ 
factory in the early war period. But it really meant little. To speak of 
cai'bon steel sheet helped a little, but not much. Further clarification 
of carbon steel sheet in terms of a specific gauge, size and shape was 
better, but still not sufficient for complete clarity. To carry the needed 
extensions of definition to extremes, which would be necessary for com¬ 
plete and unequivocablc understanding, obviously would have entailed 
the most minute description of characteristics of a product or material. 
Even this would not be enough. In a war period, rapid changes take 
place in old products and there is a constant introduction of new prod¬ 
ucts with new names, characteristics and uses. In such circumstances 
workable definitions and classifications are all that can be expected. 
Much time and experience, however, are required before this condition 
can be reached. 
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1 Standard Indttstrial Classifieationt Central Statistical Board (through 1937); Bureau of the Budget 
(1940 to present). Fart 1, Manufacturing Industries, December 10, 1937. (Tentative draft.) 

Volume I, Part I, Manufacturing Industries, List of Industries, May 10,1938. 

Volume I, Part I, Manufacturing Industries, lost of Industries, 1939. 

Volume I, Part III, Manufacturing Industries, Alphabetic Index of Products, Establishments and 
lucernes, 1939. 

Volume I, Part IV, Manufacturing Industries, Alphabetic Index of Products, Establishments and 
Processes by Major Industry Groups, 1939. 

Volume II, Port I, Non-manufacturing Industries, List of Industries, 1939. 

Volume II, Part II, Non-manufacturing Industries, Description of Industries. 1940. 

Volume II, Part III, Non-manufacturing Industries, Alphabetic Index of Products, Establish¬ 
ments and Services, 1940. 

Volume II, Part IV, Non-manufacturing Industries. Alphabetic Index of Products, Establidiments 
and Services by Major Industry Groupo, 1940. 

Volume I, Fart II, Manufacturing Industries, Description of Industries, 1940. 

Volume I, Manufacturing Industries, 1941. 

Volume II, Non-manufacturing Industries, 1942 (still current). 

Volume I, Part I, Manufacturing Industries, Titles and Descriptions of Industries, November 
1945. 

Volume 1, Part II, Manufacturing Industries, Alphabetic Index, December 1945. 

Volume II, Non-manufacturing Industries, 1942, available at GPO for 25 fi. 

Volume I, Part 1, November 1945, available at GPO at 75^. 

Volume I, Part II, December 1945, available at GPO at SI. 00. 

Standard ClaasiJUd LUt of Commodittest Prepared under auspices Executive Office of the President, 
Bureau of the Budget; War Production Board; and Department of the Treasury, Division of Procure¬ 
ment. First edition (for comment only), July 1942. Reg. First edition, December 1942. Revised edition 
(Vol. 1), May 1943. Vol. II of Alphabetic Index, February 1944. Supplement to Vol. II, November 1945. 
July and December 1942 reports not available except in the files of the Bureau of the Budget. Other re¬ 
ports available at XT. S. Government Printing Office (May and Feb. 1943 reports @ S1.25, and November 
report ® S2.00.) 
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Review bt Alan E. Trbloar 
Professor of Biostatistics, University of Minnesota 
Minneapolis H, Minnesota 

S HORTAGE of qualified personnel within the statistical profession has been 
a matter of concern to employers demanding services from that source. 
The reports reviewed herein clearly testify that those charged with leader¬ 
ship in recruiting and training such personnel not only share that concern, 
but arc acutely aware of grave problems basic to the unbalance which, if not 
removed, will lead to continued deterioration as well as wastage of educa¬ 
tional opportunity in a very broad sense. 

Appreciation of the usefulness of statistical methods was limited to a few 
farsighted pioneers at the beginning of this century. Response to their 
demonstrations of new power in analysis was very slow in developing. Biolo¬ 
gists and workers in the social sciences generally were not able to assimilate 
the new biometry being created for them; for the most part they not only 
lacked the mathematical training so necessary to appreciation of its ap¬ 
proach, but were most reluctant to believe that there was any hope that 
the variations which beset them were amenable to mathematical treatment. 
Only a few of broader vision saw great hope in the new tools and applied 
themselves to work on the frontier with tremendous enthusiasm despite their 
poverty in mathematics. With indefatigable energy they set an example in 
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both teaching and research that gradually leavened the main lump. Their 
teaching was eminently practical. It had to be, for mathematical abstraction 
quickly bereft them of followers. Students learned by analogy how to apply 
the methods rather than to comprehend fully the basic principles of a new 
philosophy and the limitations imposed by the simplifying assumptions made 
by the theorists to reach the first solutions. These students have in turn be¬ 
come teachers, carrying responsibilities for which they were equipped only 
in a very limited sense. 

In reviewing the present situation it is well not to lose sight of the contri¬ 
butions made by these pioneers with all their handicaps. Statistics enjoys a 
large demand today and is what it is largely because of them. Theoretical 
statisticians who now strive to serve the sciences and assume leadership re¬ 
sponsibilities for the future development of statistics should properly temper 
criticism, no matter how justified it may seem from specific points of view. 

The reports being reviewed herein, differing sharply in their emphasis on 
criticism, are in close agreement when considering what should be done in 
the future. Both reports were preceded by another* from the Committee 
on Applied Mathematical Statistics (L. P. Eisenhart, S. S. Wilks, C. I. 
Bliss, E. TJ. Condon, H. 0. Gulliksen, L. J. Reed, C. P. Roos, W. A. Shew- 
hart, H. M. Smallwood, F. F. Stephan) of the National Research Council, 
which urged the same objectives. All three should be considered together. 
The American point of view is thus more adequately represented, both in 
numbers and in balance of practical and theoretical components. The Royal 
Statistical Society report is based on the deliberations of many persons rep¬ 
resenting a wide coverage of both theoretical and applied statistics. 

One need not review here evidence submitted of interest in and need for 
statistical methodology. A brief summary of the proposals for meeting the 
need would be in order, however. An introductory course of an orientation 
type is recommended for all college students as an essential component in a 
truly liberal education. Probability has replaced certainty in the modern 
way of looking at things, and those not familiar with the concept of aggre¬ 
gates and the fundamentals of the statistical argument in relation to it face 
a difficult mental reorientation in their efforts to keep abreast of life’s affairs. 
The committees look forward to the day when this orientation will be a 
regular part of the secondary school education, but for the immediate future 
provision must be made for it to be given in the college curriculum. It is 
judged that the course would most profitably be built around experimental 
demonstration; it should not require special mathematical ability. 

A basic course in statistical reasoning and methodology covering princi¬ 
ples common to all applications and given centrally for all students needing 
more than just a general orientation is strongly urged. The present system 


* Personnel and Training Problems Created by the Recent Growth of Applied Statistics in the United 
States, Repxint and Circular Series Number 128. Washingtonp D. C.: National Research Council, May 
1947. Pp. V, 17. Paper. $0.25. Reviewed in this Joubnal (Deo. 1947), by George W. Snedeeor and Hol¬ 
brook Working. 
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of many departmental courses of all qualities is judged to be badly in need 
of overhaul; applications departments should yield the common ground, 
concentrating on facets peculiar to their own special fields, thus fostering 
interest and competence. It is acknowledged that the general course would 
have to be given for some time with two levels of mathematical prerequisites, 
but these would ultimately fuse with calculus as a prerequisite for all. The 
two courses given meantime should correspond in every way possible and 
have the same coverage. 

Departments of application should continue to give statistical courses 
reconstituted to accord with the plan stated in the preceding paragraph? 
These courses might be regarded as intermediate in grade. They should 
embrace discussions of sources of data, limitations peculiar to them, and 
applications of methods with special emphasis on practical considerations. 
Attention would be given here to procedures of interest to the field of appli¬ 
cation but not of suflGicient general importance to be included in the central 
introductory course. 

Offerings at the advanced level would be planned in accord with facilities 
made available to meet existing needs. The IlMS report attaches great impor¬ 
tance to the training of future teachers through the route of advanced 
courses in statistical theory demanding advanced mathematics as a prerequi¬ 
site. While all reports emphasize the importance of experience in applica¬ 
tion as an essential concomittant in statistical training at all levels, there 
is a note of warning in one (IMS) against allowing the specialist in statistical 
method to participate too extensively in applications lest they "so engross 
his attention that he cannot keep up with the progress of statistical method 
itself.” History seems to this reviewer to indicate a drift toward theory by 
those of considerable competency in both theory and applications. 

Centralized teaching of statistical method within any college, as envisioned 
by these committees, calls for creation of a special teaching unit whose per¬ 
sonnel would be free to study statistics for its own sake. "In statistics, as in 
other subjects, a profession of general practitioners requires the support of an 
inner circle of specialists, research workers and teachers; and the inner circle 
maintains its strength partly through teaching and serving the general pro¬ 
fession. , . . Where this advanced teaching can be associated with an impor¬ 
tant research school of statistics, considerable advantage will follow” (RSS). 
An independent department or institute of statistics is favored by all three 
reports, but where attachment to an existing department is insisted upon 
administratively the RSS committee recommends affiliation with mathe¬ 
matics. It is preferred that this unit have its own budget and be equipped 
as completely as possible with all facilities for practical work and research, 
including easy access to a fully adequate library of original sources and 
mathematical tables. This unit and the statistical units within applications 
departments should be drawn into close association through a coordinating 
committee, each gaming strength through reciprocal aid. Such a committee 
drawm solely from applications departments must not be accepted as a sub- 
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stitute for an administratively independent unit devoted primarily to teach¬ 
ing and research in statistical methodology. 

The RSS report makes no comment on current standards of teaching in 
statistics. The NEC committee acknowledges as undesirable the wide vari¬ 
ation in quality of statistics courses offered by applications departments 
within the colleges of this country, and the vested interests which the de¬ 
partments have developed in those courses, but goes no further. Only tho 
IMS Committee has the courage (if courage it be) to attack forthrightly the 
procedures of appointment and administration of statistical personnel 
within the applications departments. An olive branch is offered in statements 
of the form: “All this does not mean that any considerable number of people 
teaching statistics are unworthy or objectionable members of the academic 
community. Many, indeed, are of superior intellect, upright character, per¬ 
sonal charm, and undoubted teaching ability. Some are making creative 
contributions to other subjects. The only trouble is that they are teaching a 
subject in which they are not specialists, and which progresses so fast that 
only specialists can keep up with it.” The dove of peace, laboring somewhat 
in its flight, makes no call at the offices of the responsible administrators. The 
argument is plausible, skillfully phrased, and reading it provides some enter¬ 
tainment. But this reviewer doubts that it will add to attainment of the ob¬ 
jectives with which the report is so much concerned. A supply of adequately 
trained personnel in the sense defined has not been available. Theoreticians 
were very slow in responding to the need they are now so anxious to fill. 
They will be welcomed with unquestioning enthusiasm just as soon as they 
demonstrate that they can do the job more effectively than their predeces¬ 
sors. The affinity of theoretical statistics and mathematics is unquestioned. 
The report admits that mathematicians have so behaved in their teaching 
as to make children “resolve to quit mathematics as soon as possible.” 
Nothing surely would be more welcome than clear demonstration that such 
performances will have no place in the techniques adopted by theoretical 
statisticians in undertaking the heavy responsibility of teaching statistics 
on a college wide scale. 

To say these reports are important is to belabor the obvious. They con¬ 
stitute a most commendable attempt to plan for a better future. All teachers 
of statistics should read them carefully and reflect constructively on the 
problems they attack. 
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Investment, Location, and Size of Plant: A Realistic Inquiry Into the Structure 
of British and American Industries. P. Sargant Florence (Professor of Commerce); 
assisted by W. Baldamm (University Research Fellow). (University of Bir¬ 
mingham, Birmingham, England.) National Institute of Economic and Social 
Research, Economic and Social Studies VII. London N.W.l: Cambridge Uni¬ 
versity Press (Bentley House, 200 Euston Road), 1948. Pp. adii, 211. 18s. (New 
York 11: Macmillan Co. [60 Fifth Ave.] $5.00.) 

Review by Edgar M. Hoover 
Staff Economist, Council of Economic Advisers 
Executive Office of the President, Washington $5, D. C. 

P ROFESSOR Florence reports here on an anal3rsis of three interrelated at¬ 
tributes of industries: prevailing size of establishment, degree of geo¬ 
graphic concentration, and invested capital per man. The basic data are 
primarily British and American production censuses of the latest available 
prewar years. He develops measures for each of the three attributes, seeks 
to explain the observed associations among them, and evaluates the signifi¬ 
cance of the findings for purposes of public policy. 

The book as a whole represents a notable advancement of the state of our 
knowledge of industrial structure. Its value is enhanced by the presentation 
of industry-by-industry tabulations of nearly all the statistics discussed. 

At times readability and directness appear sacrificed to wealth of sugges¬ 
tive detail. For example, some pages of text and one whole appendix are de¬ 
voted to measures of the ‘‘heaviness’' of industries, though the concept of 
heaviness is not used anywhere else so far as this reviewer could discover. 
On the whole, the reader will be grateful for the wealth of suggestive loose 
ends tagged for further exploration. 

Size of plant is measured (after a discussion of alternatives) by number of 
employees. The extreme skewness of the size distribution of plants in an 
industry is circumvented by using frequency tabulations in which each 
class limit is 2 or 2| times the next lower one. All the author claims in behalf 
of this inelegant procedure is that it seems to work: a distinct modal or “prev¬ 
alent” size class appears in most industries except where wide variety of sizes 
is to be expected on a priori grounds. 

The aspect of location with which this book deals is the deviation of the 
geographic distribution of a given industry from that of industry as a whole. 
The “coefficient of localization” has already gained fairly general acceptance 
as a result of its previous use by Florence and others, but it is here tabulated 
for the first time for a nearly complete list of industries of both countries. 

The “intensity of investment” of an industry poses the most serious prob¬ 
lems of measurement. Horsepower capacity per worker is chosen as the 
least objectionable of several possible indicators. 

When specific British industries are compared with their American 
counterparts, a striking degree of international correspondence appears in 
plant size, location pattern, and use of power. This leads the author to his 
basic theses: that the basic technical operations of industries (and not merely 
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the national political, social and financial milieu) are significant in accounting 
for interindustry differences, and that size, location, and use of power are 
interrelated characteristics. 

In analyzing the relation between an industry's typical size of plant and 
its coefficient of localization, Florence uses simple contingency tables. The 
observed relation of plant size to localization is however, as earlier studies 
by Florence had indicated, a complex one. It goes considerably beyond the 
range of statistical proof as yet. 

Plants in many of the most scattered or “ubiquitous” industries show no 
definite central tendency in size. In areas of dense concentrated market or 
material supply, plants can be larger than elsewhere, so a diversity of plant 
sizes is naturally found within a single industry. 

In industries with intermediate degrees of localization, greater localization 
is associated with larger plant size. 

The most highly localized industries are characterized by medium-sized 
plants because (a) in these industries transport costs do not limit size of 
plants, so plants need not be small to be close to scattered suppliers or mar¬ 
kets; and (&) close juxtaposition of plants makes it possible to secure many 
of the economies of large-scale production by virtue of the size of the 
production center rather than of the individual plant—^plants therefore can 
specialize more narrowly, and do not have to be large to be efficient. This 
hypothesis as to highly localized industries is tested by a more detailed 
analysis of concentrations of metal trades in Birmingham and textiles in 
Lancashire. 

In exploring the relationship between size of plant and horsepower ca¬ 
pacity per worker, Florence uses the same technique of contingency tables. 
In general, larger plants are associated with greater use of power per man, 
greater financial requirements, and greater prevalence of multi-plant firms. 

The last part of the book speculates on implications of a continued 
trend toward larger plants and larger firms. There is little novelty in the 
author’s case that protection of the interests of labor and the consumer will re¬ 
quire a better and broader science of industrial management along with 
watchful public scrutiny or control of monopolistic industries.The most inter¬ 
esting point in the discussion of policy problems is an explanation of the 
difficulty of economically encouraging small business and dispersion of in¬ 
dustry to small nonsuburban communities. According to the relationships 
shown by Florence’s statistical analysis, most industries that are not tied 
by high transport costs to local markets or sources of materials are either 
highly clustered or characterized by large plants. Thus the promotion of 
small decentralized business faces the problem of increasing the efficiency 
of small plants or breaking up or transplanting complex localized groupings 
of plants. There may be further conflict between the aims of decentraliza¬ 
tion in nonsuburban small communities and local diversification of pro¬ 
duction. 

These barriers are not necessarily impassible. The observed associations 
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of size, location, and investment on which Florence bases his case are suffi¬ 
ciently loose to provide many exceptional industries and plants, and the 
economic advantages of size and localization are not immutable. Nonethe¬ 
less the basic difficulties remain and this book has done a useful service by 
bringing them to light. 


Foundations of Economic Analysis. Paul Anthony Samuelson (Professor of 
Economics, Massachusetts Institute of Technology, Cambridge, Mass.}. Har¬ 
vard Economic Studies, Vol. 80. Cambridge 38, Mass.: Harvard University 
Press (38 Quincy Street), 1947. Pp. xii, 447. $7.50. (London E.C.4: Oxford Uni¬ 
versity Press [Amen House, Warwick Square], 1948. 42s.) 

Review bt George J. Stigler 
Professor of Economics^ Faculty of Political Science 
Columbia University, New York 

P ROFESSOR Samuelson’s distinguished doctoral dissertation was largely 
completed about a decade before its publication (p. vii). This delay in ap¬ 
pearance is doubly unfortunate: unfortunate for other economists, deprived 
of portions of its brilliant theoretical analysis and criticism not hitherto pub¬ 
lished; unfortunate for Samuelson, much of whose subject matter is cur¬ 
rently of much diminished interest. But the problems he deals with have been 
and will continue to be important and his treatment of them should be read 
by all mathematically trained economists interested in formal theory. 

The leitmotiv of Samuelson^s treatise is the search for meaningful theorems, 
that is, theorems which are in principle capable of empirical contradiction. 
The statement that a person seeks to maximize utility is (in many versions) a 
tautology: it is impossible to conceive of an observational phenomenon that 
contradicts it. The statement that demand curves have negative slopes on 
the other hand, is capable of empirical contradiction, and hence meaningful. 
There is no question but that this criterion is an excellent scientific sieve, 
whereby tautological and observationally undefined theorems (with which 
economics is well supplied) may be separated out from the theorems with 
content. 

But it is not true that theorems about unobservable things are necessarily 
meaningless in a social science. On introspective grounds—and this is a per¬ 
fectly valid and indeed indispensable source of information—one believes 
that there is such a thing as utility, and that men strive for it. One can de- 
deducc additional conclusions (of a welfare nature) from this information, 
that cannot be reached by a strict positivist. Thus, we know and find it 
significant that, with given prices and money income, a consumer feels better 
off making his own choices than having someone else make them for him. 
The whole purpose of utility analysis in economics is to permit such welfare 
propositions, and such propositions are meaningful even though based upon 
introspection and communication. 
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Two different, but related, methods are employed by Samuelson to derive 
meaningful theorems. The first method is by use of the secondary (or suflBi- 
ciency) conditions for an extremum; we may illustrate it by a very simple 
example. Let a competitive firm be operating at a maximum profit output 
where marginal cost equals price; the secondary condition is that marginal 
cost is increasing at this output. Let the price now rise: if the theory is correct, 
the output of the firm will rise or at least not fall (even if the marginal cost 
curve is vertical)—Whence if a decline in output follows a rise in price, the 
theory is contradicted, 

Samuelson gives a variety of illustrations of this approach (Chaps. 2-3) 
and then proceeds to more detailed studies of the output of the individual 
firm (Chap. 4) and the demand of the individual consumer (especially Chap. 
5; special mention should also be made of the excellent discussion of index 
numbers in Chap. 6). Many of the theorems are already familiar, through 
the work of Slutsky, Hicks and Allen, Stackelberg, Mosak, etc., but nowhere 
else can one find so comprehensive a treatment. 

There are only two points I would raise in connection with this first method 
of deriving meaningful theorems. The first is: why should not theorems on 
groups (industries, for example) also be derived? It is implied (p. 5) that this 
is impossible because an industry equilibrium does not represent a maximum 
podtion, but this is surely mistaken: the particular entrepreneurs are in the 
industry because it is the most profitable place to be. 

My second comment is connected with the question: what if the theorem 
is contradicted by observation? Samuelson says it would not matter much 
in the case of utility theory (p. 117); I would say that it would not make the 
slightest difference. For there is a free variable in his system: the tastes of 
consumers. These tastes are not observationally (or operationally) defined, 
so any contradiction of a theorem derived from utility theory can always be 
attributed to a change of tastes, rather than to an error in the postulates or 
logic of the theory. The situation is less acute in the theory of the firm, for 
the corresponding variable (technology) is not observationally so free. But 
Samuelson's (and everyone’s else) utility theory is still tautological as an 
explanation of market behavior. 

Samuelson’s second method of deriving meaningful theorems is to draw 
the consequences of dynamic stability. For example, we may assume that 
price will rise (fall) if the quantity demanded at the present price exceeds 
(is less than) the quantity supplied, and that the greater the rate of change of 
price, the greater the discrepancy between quantity demanded and quantity 
supplied (p. 263). Then if, after an initial disturbance, the price is to ap* 
proach an equilibrium value, the demand curve must cut the supply curve 
from above. A miscellany of examples of this technique are given (Chap. 9), 
the most interesting being those for a Amplified Keynesian system. 

Thereafter Samuelson enters on a roving discussion of dynamic theory. 
Most discussions of economic dynamics, I feel, would better be entitled, 
"What I Know about Differential and Difference Equations”; with Samuel- 
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son’s mathematical erudition, no doubt his version is, "What the Reader 
Should Enow of Differential and Difference Equations." The economics, as 
usual, is appallingly absent. Some of the infinities of mathematical possi¬ 
bilities are discussed, but only in the most formal terms; there is no instance 
of the enlargement of our knowledge of economic processes in our society. 
Samuelson may reply that he is only providing tools, but who can know what 
tools we need unless he knows the material on which they will be used? 

Although it is not an integral part of the search for meaningful theorems, 
welfare economics of the Hotelling variety (i.e., somewhat free of value judg¬ 
ments) also receives considerable attention (Chap. 8). I persist in my belief 
that this is one of the less fertile areas that modem economists till; it is symp¬ 
tomatic that we have elaborate instructions on how to form welfare judg¬ 
ments that do not depend on value judgments, but we have no illustration of 
the application of this technique to a real problem of contemporary policy. 
Samuelson indeed offers much support for this skeptical view, by this enumer¬ 
ation of the assumptions of the new welfare economics (pp. 222ff.), most of 
which are held to be partly invalid. But he fails to examine other facets of 
the problem, one of which, I think, is especially significant. When economists 
are writing freely on desirable policy, that is, when they are not writing on 
methodology, the disputes are almost always over how the economic system 
works, and not over the goals that should be sought. Thus, although I dis¬ 
agree with some of the policy recommendations in Samuelson’s recent text^ 
book, I do not differ (except in emphasis) from the goals that they are in¬ 
tended to subserve. 

Samuelson’s exposition is consistently mathematical, and presupposes 
a knowledge of advanced calculus, higher algebra, and differential equations 
(in appendices he summarizes some theorems from these branches and gives 
a lengthy and technical introduction to difference equations, which to my 
unmathematical mind seems much more difficult than say Jordan’s book on 
finite differences). I consider this failure to provide translations for the 
“literary” economist a serious shortcoming of his work. He dismisses trans¬ 
lations into words as “mental gymnastics of a peculiarly depraved type.” I 
disagree. There is no depravity, nor is there virtue, in telling other competent 
economists things in a language they all can understand—^there is simply 
responsibility to the canons of scholarship. 


Sampling Inspection: Prindples, Procedtires, and Tables for Single, Double, 
and Sequential Sampling in Acceptance Inspection and Quality Control Based 
on Percent Defective. By the Statistical Research Group, Columbia TJniverdty, 
Applied Mathematics Panel, Office of Scientific Research and Development. 
Edited by E, A. Freeman (Associate Professor of Statistics, Massachusetts 
Institute of Technology, Cambridge, Mass.); MiUon Friedman (Associate Pro¬ 
fessor of Economics, University of Cldcago, Chicago 37, 111.); Frederick MosteHer 
(Associate Professor of Mathematical Statistics, Department of Social Relations, 
Harvard University, Cambridge, Mass.); and TT. AUefn WaXLis (Professor of Sta- 
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tistics and Business Economics, University of Chicago, Chicago 37, Ill.). New 
York 18: McGraw-Hill Book Co., Inc. (330 West 42nd St.), 1948. Pp. xx, 395. 
$5.25. (London W.C.2: McGraw-Hill Pubhshing Co., Ltd. [Aldwych House, 
Aldwych]. 31s. 6d.) Two reviews follow: 

Review by Edwin G. Olds 

Associate Professor of Mathematics, Carnegie Institute of Technology 
Pittsburgh 13, Pennsylvania 

S AMPLING inspection is the process of deciding whether to accept or reject 
an aggregate of material on the basis of the evidence obtained by in¬ 
specting only part of it. The evidence may be in the form of percentage of 
units nonconforming or may consist of measurements for each unit inspected. 
It is customary to denote the procedure, in the first case, as acceptance 
sampling by attributes and, in the second case, as acceptance sampling by 
variables. 

Early in 1945 the Statistical Research Group, Columbia University, oper¬ 
ating under the direction of the Applied Mathematics Panel, Office of Scien¬ 
tific Research and Development, was asked by the Navy to prepare a manual 
on sampling inspection, including tables, procedures, and principles. A 
manual on acceptance sampling by attributes was prepared first. The close 
of the war prevented completion of the work then under way on acceptance 
sampling by variables. The present volume is an expansion of the prelimi¬ 
nary edition of the Navy manual. 

The book has the subtitle: Principles, Procedures, and Tables for Single, 
Double, and Sequential Sampling in Acceptance Inspection and Quality 
Control Based on Percent Defective. The last four words should be empha¬ 
sized since, in view of the main title, Sampling Inspection, the reader might 
otherwise expect to find some discussion of sampling inspection based on 
measurements. The reviewer’s main criticism of this important book might 
be that no treatment of acceptance sampling by variables has been included. 
In view of the book’s subtitle probably this is an unfair cavil and merely 
reflects the reviewer’s disappointment that this companion problem still 
lacks the same comprehensive exposition which the present volume now 
makes available for acceptance sampling by attributes. 

The three purposes of the book (as stated by the authors on p. 3) are: 
(1) “to explain the general principles of sampling inspection as a method for 
determining the quality and acceptability of material and as a method for 
controlling the quality of material”; (2) “to describe in detail a standard 
sampling-inspection procedure that may be readily applied to a wide variety 
of products”; (3) “to provide an extensive catalogue of sampling plans, usa¬ 
ble both by those who adopt the standard procedure and by those who con¬ 
struct tailor-made procedures,” With the restriction noted above, the book 
succeeds admirably in accomplishing all three purposes. It will meet the 
needs of three kinds of readers: those who want a ready-made sampling- 
inspection program, those interested in building their own programs, and 
students of acceptance sampling. 
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The book is divided into five parts. Part I consists of a chapter of intro¬ 
duction; Part II contains five chapters of discussion of the principles of sam¬ 
pling inspection for attributes; Part III, with seven chapters, gives a descrip¬ 
tion of a standard sampling-inspection procedure; Part IV has four chapters 
on the construction of sampling tables and procedures. The foundation of 
the book, as noted in the preface, is the set of sampling tables presented in 
Part V. 

As suggested in the Preface, Parts II, III, and IV have been written in such 
form that anyone needing a ready-made sampling inspection program can use 
Parts I, III, and V, omitting Parts II and IV; readers interested in building 
their own programs can substitute Part II for Part III; and, finally, those 
who want to see all of the technical details of tabular construction will find 
them in Part IV. Probably this plan of organization increases the scope of 
utilization of the book, but it requires that there be a considerable amount 
of repetition. 

Ordinarily, the surest way to avoid pleasure from reading a novel is to read 
the last chapter first and certainly it is unorthodox to review a book in that 
order. This reviewer, however, could not resist the temptation to turn to 
Part V and, after much consideration, decided this would be an appropriate 
part to describe first. 

There are four tables in Part V. Table 4, which is captioned “Sampling 
Plans and Operating-characteristic (00) Curves, Classified by Sample-size 
Letter, and by Acceptable-quality Level (AQL) and Average Outgoing- 
quality Limit (AOQL),*^ consists of 159 pages. With a few exceptions, each 
page contains a single, a double, and a group sequential sampling plan to¬ 
gether with the operating characteristic curves for each. The three plans are 
matched on the basis of their operating characteristic curves. 

On page 288, a fairly representative page, there is a single sampling plan 
specifying a sample of 75 and an acceptance number of 1; a double sampling 
specifying a first sample of 50 and a second sample of 100, with accept¬ 
ance number of 0 and rejection number of 3 for the first sample and with ac¬ 
ceptance number of 2 and rejection number of 3 for the combined sample; a 
sequential plan, having seven steps of 20 units each, with decision assured at 
the end of the seventh sample. Printed in color, in the lower part of the page, 
are the operating-characteristic curves for the three plans. 

The single and double plans, at least, are familiar since they appear in 
the Standard Sampling Inspection Tables issued by the Army Service Forces 
as part of ORD-M608-8. (This is not surprising since, in the words of the 
authors (p. vi) “the tables and procedures in this book are in large measure 
adapted from or based on Army Ordnance tables and procedures.") The 
double sampling plan, for example, is given in Table 1, Normal Lot-by-Lot 
Inspection, corresponding to an Acceptable Quality Level of .26—60% for 
major defects and sublots of size of 500 to 799. The approximate AOQL is 
recorded as 1%. In the volume under review, however, the classification is 
“Sample-size Letter; H; AQL class: 0.32-0,65; AOQL class; 0.90-1.6," The 
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classification by sample-size letter, then would seem to have replaced classi¬ 
fication on the basis of lot size. Actually the change is not so direct, or so 
easy to dismiss. 

In order to find the appropriate sampling plan in the set of Army Ordnance 
Tables, the type of sampling (single or double) and the acceptable quality 
level must be specified; the sublot size must be known; and information re¬ 
garding the level and state of control of the manufacturing process should 
be available. For any particular type of sampling, sublot size and acceptable 
quality level, three plans are available; a “normal” plan, to be used when the 
process average is near the acceptable quality level; a “reduced inspection” 
plan, when the process is in control at a level significantly better than the 
acceptable quality level; and a “more exacting” plan, when the average 
quality of inspected product is significantly worse than the acceptable quality 
level. The “reduced inspection” plan, when appropriate, requires about one- 
fifth as much sampling as the “normal” plan. The “more stringent” plan uses 
the same sample sizes as the “normal” plan, but lower acceptance numbers, 
thus causing the rejection of a greater percentage of imperfect lots. The tables 
contain the criteria upon which a decision between plans is made for each 
lot which is submitted. 

The first step in abstracting the appropriate sampling plan from the set 
of plans provided in the book under review might be to enter Table 1 of 
Part V. This table is captioned “Sample-size Letter, by Inspection Level 
and Size of Inspection Lot.” There are five “inspection levels”: I, II, III, 
IV, V. The amount of inspection at these levels is in the ratio 1:1J :2:3:4 
(approximately), A footnote states that “inspection level III will be used for 
most products for normal inspection,” and, corresponding to this level and 
lot sizes 500-800 is found sample-size letter II, which appeared on page 288 
in conjunction with “normal” Army Ordnance plans. Any of the other four 
levels, however, might be chosen as “normal,” the choice depending on eco¬ 
nomic considerations. 

In order to proceed further it is necessary to fix an Acceptable Quality 
Level. Table 2, “Summary of Sampling Plans Classified by Acceptable- 
quality Level and Sample Size Letter,” has subtables for single-sampling, 
for double-sampling and for sequential-sampling. The specific plan to be 
adopted can be obtained here or direct reference may be made to Table 4, 
where three types of plans are matched as described above. 

The reader who turns directly to the tables of Part V probably will be at 
a loss as to what inspection level to choose for “normal” inspection and, if 
he is famfiiar with other sets of sampling plans, he may wonder, perhaps, 
under what conditions there should be some change of plan in order to pro¬ 
vide for “reduced” or “tightened” inspection. The term “inspection level” 
seems to be new. As explained on pages 45-46 and elsewhere, it is set for the 
product irrespective of the size of the lots into which the product is subdi¬ 
vided for inspection purposes. The higher the inspection level for a given 
lot size, the better will be the discrimination between good and bad lots* For 
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the same acceptable quality level and lot size the plans specified for the five 
inspection levels, provide, in general, for a decrease in risk of accepting un¬ 
wanted product with an increase in inspection level. Inspection level is not 
to be confused with percentage of inspection; these tables follow Army Ord¬ 
nance in providing for some increase in discrimination (and, therefore, 
sample size) as the lot sizes increase. 

Footnotes in Table 1 direct the user to drop his inspection level two steps 
for reduced inspection, but to retain the normal level for tightened inspec¬ 
tion. It is unfortunate that Table 1 contains no reference to Part III, 
Chapter 11 (pp. 120-125) for it is here that the authors tell when and how 
to change inspection plans. The criteria and procedures are substantially 
the same as given in the Ordnance Tables. 

If rejection of lots is to be interpreted as sentencing lots to 100% inspection 
then it is important to know the Average Outgoing-quality limit for ma¬ 
terial inspected. Table 3, “Sample-size Letters for Average Outgoing-quality 
Limit (AOQL) Classes and Acceptable-quality Level (AQL) Classes,” sup¬ 
plies the required information. It may also be used to get sampling plans when 
an AOQL is specified. 

To indicate further the difference of these tables from the Army Ordnance 
Tables it seems desirable to quote the authors (p. 142). 

“The major respects in which the present collection of sampling plans 
differs from the Army Ordnance collection are as follows: (a) it includes 
sequential-sampling plans as well as single- and double-sampling plans; (6) 
it includes plans for smaller sample sizes—^for example, the single-sampling 
in the main Army Ordnance single-sampling table (Table C) have sample 
sizes varying from 75 to 1,500; in Table 2 of this book, the sample sizes vary 
from 5 to 1,500; (c) it uses a different concept of AQL; (d) all the plans are 
contained in a single table (Table 2), whereas Army Ordnance uses supple¬ 
mentary tables for reduced and tightened inspection and for inspection for 
incidental defects.” 

It would be very culpable for the reviewer to imply that this book contains 
merely a set of sampling plans together with brief instructions regarding their 
utilization. On the contrary, the book treats the subject of sampling inspec¬ 
tion by attributes in great detail. The bases for the decisions necessary in 
installing and operating acceptance sampling are discussed from almost every 
angle. The methods of constructing the sampling plans are carefully illus¬ 
trated. Various possible alternatives are examined and the reasons why they 
were discarded are given in full. The reviewer has been unable to think of 
any question regarding the tables which has not been answered somewhere 
in the book. Nor has he entertained any criticism of choice of procedures 
without being able to discover that the authors have anticipated the possi¬ 
bility of just this criticism and have prepared a strong brief in support of 
their procedures. This insistence on complete coverage makes the book a 
valuable reference, especially in view of the scarcity of general publications 
in the field. 
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The introduction to the principles of sampling inspection for attributes 
which is given by Part II is excellent. Using well-chosen examples the authors 
describe single, double and group sequential sampling; introduce the reader 
to operating characteristic, average outgoing quality, and average amount of 
inspection curves; and discuss the various factors connected with the in¬ 
stallation and operation of sampling plans. Methods of constructing con¬ 
trol charts for fraction defective and for defects per unit are well presented 
and the use of sampling plans for control sampling is discussed. Finally there 
is a brief, but pointed appendix on the relation of sampling inspection to 
design specifications. 

Part III describes how the set of tables in Part V can be used as the basis 
of an integrated inspection program. The considerations necessary in start¬ 
ing a sampling procedure are outlined and then the details of the operation 
of a sampling procedure are set down. Step-by-step the inspector is told 
what to do, how to do it, and why. Here he finds rules for drawing the 
sample, maldng his decision on the lot, calculating process average, changing 
from normal to reduced or tightened inspection, etc. This is the part of the 
book which activates the tables. It is an expert job and major excerpts will, 
no doubt, be quoted in many an inspection manual. 

The somewhat limited class of readers who are concerned with methods of 
constructing sampling tables will welcome Part IV with enthusiasm. Up to 
present time detailed knowledge of the procedures used in calculating tables 
has been restricted largely to those intimately connected with the actual 
task. Given an individual sampling plan, it is relatively simple to write the 
formulas and make the calculations necessary to determine how the plan can 
be expected to operate; given rational requirements for an individual plan, 
it is not too difficult to write the equations which, if they could be solved, 
would lead to a plan exactly meeting the specifications; but, given alterna¬ 
tive sets of desired properties, none of which can bo satisfied exactly, the 
problem of discovering which properties can be imposed on a sot of sampling 
plans at a satisfactory level of uniformity and then planning the execution of 
the necessary computations is neither small nor easy. Therefore, anyone who 
has this last problem to solve has the greatest interest in the philosophy, 
principles and techniques basic to previous solutions. The clarity and com¬ 
prehensiveness of Part IV will be appreciated. 

There are a few harmless misprints in the book, and there is an occasional 
statement with which the reviewer could not entirely agree. On page 12, for 
instance, the remark is made that ^the acceptance or rejection of an entire 
inspection lot on the basis of the quality of a sample drawn from that inspec¬ 
tion lot provides a strong incentive for the supplier to improve the quality 
of his product.” If, as often happens, the customer’s inspection is preceded 
by a final inspection by the producer, then it is the improvement of this 
final inspection for which the customer’s rejections provide a strong incen¬ 
tive. 

In an attempt at simplification the authors have decided to redefine ac- 
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ceptable-quality level (AQL) as “percentage of defective items in an inspec¬ 
tion lot such that the sampling plan will result in the acceptance of 95 per 
cent of submitted inspection lots containing that percentage of defective 
items” (p. 144). This reviewer is not alone in deploring the placing of this ad¬ 
ditional, and seemingly unnecessary, hazard in the path of the lay reader of 
statistics. 

But, in spite of his few minor criticisms, the reviewer is convinced that 
this is an outstanding book. Anyone who is at all concerned with the problem 
of acceptance sampling by attributes needs to be thoroughly conversant with 
its content. 


Review by Paul S. Olmstead 
Bell Telephone Laboratories, Inc., Murray Hill, New Jersey 

T he authors state that Sampling Inspection presents “in a form suitable 
for the peacetime needs of industry, sampling procedures for use in either 
acceptance inspection or process control.” Only two of the seventeen chap¬ 
ters deal with process control. Hence, it may be stated that, basically, the 
book is for a consumer who is unwilling to accept any evidence concerning the 
quality of a lot except that given by test of a sample of that lot. The plans 
associated with the procedures are devised to reject approximately 5% of 
lots at the Acceptable Quality Level (AQL—^this term is defined in a new 
way) and to assure delivery of product in the long rim, subject to additional 
work by the supplier, that will meet an Average Outgoing Quality limit 
(AOQL). This incidentally places a limit on the Lot Tolerance Per Cent 
Defective (LTPD). Plans for reduced inspection on the basis of spaced con¬ 
tinued satisfactory quality from one supplier aim to keep rejections at AQL 
and acceptances at LTPD at approximately the same levels as in normal 
plans. Plans for tightened inspection to protect against poor quality from a 
supplier Tdth a previous record showing that his quality is normally poorer 
than AQL keep sample sizes the same but reduce in per cent the AQL, 
AOQL and LTPD. This places a very severe penalty on lots at AQL in many 
cases, reducing the chance of acceptance from 96% to as little as 10%. No 
consideration is given to the possible alternative of having suppliers submit 
evidence of control for consecutive lots produced prior to a current lot as 
partial evidence for the satisfactory quality of the lot submitted. Under this 
scheme, test by the consumer need be only a check inspection to verify that 
the lot submitted conforms to what might be expected on the basis of such 
evidence for control. 

In addition to the sampling plans (including Operating-Characteristic 
(00) Curves), which constitute almost half of the book, an account is given 
of methods of calculation and various bases and procedures for making use of 
the plans. It is pointed out that these plans relate only to the “quality of a 
lot” expressed by “the percentage of defective items it contains,” where the 
quality of an individual item is expressed as “defective or nondefective.” 
Part I is a short chapter of an introductory nature. Part II defines the major 
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concepts used in the book and discusses the principles of sampling inspection 
for attributes taken into account in the construction of the tables. It in¬ 
cludes simple examples of single, double, and sequential sampling plans. 
Chapter 5, in particular, points out many of the difficulties associated with 
sampling inspection, dwelling at length on the vagaries of selecting a “ran¬ 
dom sample”. The answer reached in one instance associated with spottincss 
is not apt to satisfy engineers who feel that one of the objects of sampling 
inspection is to discover evidence for just such a condition. Such evidence 
is often provided by keeping track of the order in which defectives are en¬ 
countered. Order is not mentioned in this book. The use of sampling inspec¬ 
tion data to obtain evidence for control is discussed briefly in Chapter 6. 
This chapter rightly points out that many engineers may not be satisfied with 
control charts based on data for which 5% of the points are expected outside 
control limits when the process is in fact under control. This is one con¬ 
sequence of the choice of definition of AQL. Incidentally, the authors use 
the words “take corrective action” as applying to this 5% of the points rather 
than the more appropriate “look for trouble.” The engineer “takes corrective 
action” only when he finds that a criterion based on experience as associated 
with trouble is exceeded. In other cases, he may “look for trouble” but, 
ordinarily, would not find it necessary to “take corrective action.” This part 
closes with a chapter dealing with the difference between design requirements 
and acceptance requirements. 

Part III considers a “standard sampling inspection procedure.” What is 
presented is a reasonably complete prospectus, with numerous examples, 
that may be followed as a model by those who are willing to accept the under¬ 
lying principles mentioned above. Most engineers will find much of the 
matenal useful as a general guide to a larger number of things that may cause 
trouble in setting up inspection procedures. Part IV discusses the coverage 
of the tables and the methods used in computation, dealing particularly 
with shortcuts, limits of error, and approximations. Some mathematical 
statisticians may object to the use of the Poisson Exponential as an approxi¬ 
mation to the Hypergeometric Series. Considerable space is devoted to 
showing how to compute the average amount of inspection (ASN—^Average 
Sample Number) required to reach a decision to accept or reject for each of 
the three types of sampling. Readers should be careful not to confuse this 
average amount of inspection with the average amount of inspection re¬ 
quired to give assurance that the quality of product meets the AOQL. The 
latter takes into account the detailing of rejected lots and therefore may 
differ but little for the three types of sampling. Comparative values of ASN 
are given for only a few of the sampling plans. 

Part V is devoted to presentation of the tables. These, in most cases, show 
dngle, double and limited step curtailed sequential sampling plans for ranges 
in lot size (cross referenced to a sample size letter) and ranges in AQL and 
AOQL. On each page along with a table showing three sampling plans is a 
chart with the 00 curves for these plans. These curves, being in color, are 
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subject to printing errors as explained on page x. The book also contains a 
consolidated list of references, a glossary, and an index. The relatively few 
errors, mainly associated with printing, should not cause the reader diffi¬ 
culty. 

It is quite likely that many engineers will find the sampling tables useful 
in their work. The underl 3 dng philosophy associated with their use, however, 
may lead to a somewhat different ^standard sampling inspection procedure” 
—one that would place more emphasis on process control and less on ac¬ 
ceptance inspection than the one outlined in Part III of the book. Conse¬ 
quently, this reviewer would classify Sampling Inspection as a good reference 
book but not a textbook. 

How to Chart: Facts From Figures With Graphs. Walter E. Weld (Investment 
Adviser, 227 Bawson Hoad, Brookline, Mass.). Preface by A. C. Haskell (Codex 
Book Co., Inc., Norwood, Mass.). Norwood, Mass.: Codex Book Co., Inc. (74 
Broadway), 1947. Pp. xiv, 218. $3.00. 

Hbvibw by Kenneth W. Haembr 
Statistician, American Telephone and Telegraph Company 
195 Broadway, New York 7, New York 

T his is a book for the businessman who knows only a little about charts or 
about how to make them. On the other hand, several of the seventeen 
chapters discuss chart uses and methods that even the experienced chartist— 
businessman or not—might employ with profit. 

Item number one on the credit side is the author’s style of writing. It is 
easy to read, free of unnecessary technical terms, and effective in conveying 
the author’s ideas. It can be summarized as conversational rather than 
academic. 

The illustrations are excellent, plentiful, and well drawn; they accurately 
illustrate the point under discussion, a feature more often absent than present 
in books about charts. It is a sad paradox that so many books on graphics 
are so poorly illustrated: Mr. Weld is to be congratulated on refusing to 
follow this “do as I say—don’t do as I do” line of least resistance. 

Throughout the book, the charts are explained in terms of specific business 
applications. Although this approach has the mild disadvantage of seeming 
to restrict the chart to only one t 3 rpe of use or application, it does succeed 
in bringing the charts to life and in pointing up exactly how each fits a par¬ 
ticular type of business activity. It should be an effective means of introduc¬ 
ing charts to any businessman who is just beginning to discover the extent of 
their usefulness. 

The principles of chart presentation are skillfully explained. It is perhaps 
regrettable that Mr. Weld does not accept the principle that simple bar 
comparisons (1-scale measurement) are generally best shown as horizontal 
bars, and that the vertical arrangement is best reserved for 2-scale bar pres¬ 
entation. Also, in his discussion of multiple-vertical-scale presentation he 
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has mentioned only the advantages and has neglected to point out the con¬ 
siderable disadvantages. In almost all other respects, however, the graphic 
principles set forth are beyond reproach. 

Considerable space is given to instruction on how to prepare charts. Al¬ 
though these instructions naturally reflect the author's own experience, and 
occasionally ignore the fact that there are alternatives, they should be ex¬ 
tremely helpful to the beginning chart maker. Some of these practical hints 
are especially valuable because they so seldom find their way into print. For 
example, Mr. Weld points out that since the color light blue does not photo¬ 
graph, lines and other markings that are not to appear on the reproduced 
copies should be made on the original with a light blue pencil. This may seem 
to be a small point, but it is one that saves troubles and dollars, and one that 
incidentally demonstrates the author's practical point of view. 

This book focuses attention on three types of charts that in many other 
books are discussed only briefly or are not mentioned at all. Each of them is 
a type developed from a special kind of scale: of the three, the semi-logarith¬ 
mic chart properly receives the most emphasis. However, the other two, 
probability scale and reciprocal scale charts, are discussed at sufficient length 
to explain how and when they can be used to advantage. 

The first of the two chapters on the semi-logarithmic chart provides an 
excellent introduction to the mechanics of this form of presentation. Of 
course, it is possible to quibble with some of the things the author has to say 
about the semi-logarithmic chart: it is a subject that can be quibbled about 
with very little effort. For example, he says "... it is often desirable and 
extremely helpful to know the rate of change, or percentage of gain or loss. 
The kind of chart which gives this information is called a ratio (or semi- 
logarithmic) chart.” This reviewer holds to the belief that a semi-log chart 
shows only relative rate of change. But the preciseness of the term is indeed 
a minor matter and does not detract from the general soundness of the ex¬ 
position. In Chapter 7, which discusses the uses of semi-log charts, Mr. Weld 
not only explains when the type of presentation is appropriate, but re¬ 
peatedly points out that it should not be used with the mistaken idea that it 
can simultaneously serve the purpose of an arithmetic chart. This one point 
alone, if thoroughly absorbed by the reader, would be worth the price of the 
entire book. 

The author's discussion of the probability scale and its uses in business 
charting (Chap. 8) is easy to read and is informative. Here, as elsewhere 
throughout the book, the statistical propriety of the applications cited, or 
the conclusions concerning these uses, are not considered to be within the 
scope of this review: Judged solely as an explanation of its function as a tool 
of graphic analysis, this chapter is a good introduction to this type of chart. 
Although the reciprocal chart is primarily a means of computation rather 
than of presentation, Mr. Weld makes a good case for devoting a chapter to 
it (Chap. 9). His explanation of its wide usefulness should encourage a more 
extensive use of this little-known chart. 
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In this reviewer's opinion, two other chapters deserve special mention. 
Chapter 15, “Charts of Various Bands," is a quick review of assorted chart 
varieties—from progress charts to trilinear charts—^which the author feels 
warrant inclusion. This chapter provides a useful, if limited, supplementary 
“idea" file. Chapter 16, “Exhibits Good and Bad,” is a chart clinic in which 
seventeen selected examples are subjected to critical examination. These 
should be two of the most valuable chapters in the book for the novice in 
chart presentation. 

Since it is inevitable that author and reviewer seldom agree on all things, 
some chips must fall on the unfavorable side. 

To this reviewer, the emphasis in the book seems faulty. It appears to be 
designed to focus attention on the major interests of the author rather than 
on the probable interests of the reader. Because of the author's very special 
field of interest, these two are unlikely to be the same. 

For example, stock market analysis charting and business forecasting 
fill three chapters and parts of others. This is two chapters more than is 
allowed to any other subject except semi-logarithmic charts, and seems out 
of proportion in an introductory discussion to business charting. Indeed, 
this ardent exposition seems appropriate for only one group of businessmen 
—^investment analysts—among which number the author presumably is one. 
Although other businessmen readers will no doubt be intrigued to learn that 
there are stock market patterns known as “The Head & Shoulders,” and 
“The Rounding Bottom,” to mention two of many, few are likely to feel an 
overwhelming desire to put this special knowledge into use. 

On the other hand, the subject of map presentation is covered with one il¬ 
lustration and a few sentences. So fragmentary a treatment of so important a 
subject seems scarcely worth including. 

In several spots, the sequence seems unnatural. Stock market charting 
(Chaps. 10-11) and Business Forecasting (Chap. 13), are separated by a 
chapter on “What Should Be Charted to Help the Business Manager.” 
Chapter 14, “Making and Reproducing Graphs,” is sandwiched between 
“Business Forecasting” and “CWts of Various Bands.” 

There is a considerable amount of overlapping. This need not be a fault if 
it is carefully planned with a legitimate purpose, but in this case much of it 
has the appearance of being partly incidental and partly accidental. For 
example, one of the subheads under Chapter 17 is “Procuring Suitable 
Chart Sheets.” Under Chapter 14 there is a subhead “Selection of Suitable 
Chart Paper.” In fact, the Codex Company or its products are mentioned one 
or more times on 18 pages throughout the book. It is only fair to point out 
that several of these mentions include invitations to call on the Codex Com¬ 
pany for further advice or guidance. Nevertheless, this reader found it dis¬ 
turbing to have the publisher's name thrust forward with such artless 
frequency. Another example of overlapping: One of the items covered in 
Chapter 4 is the matter of choosing lines or spaces to represent units of time. 
Although no mention is made in the chapter heading, Chapter 5 also discusses 
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this problem at some length. There is, of course, some advantage in repeti¬ 
tion at strategic points, but this reader for one prefers to find most of the 
information on a given subject set down on adjacent pages. 

Although it makes for ease of reading and a fast moving exposition, the 
author’s penchant for singling out one device or one method and neglecting 
to mention the alternatives, does a disservice to the reader who wants to 
know the best way rather than just a good way. For example, when discuss¬ 
ing lettering guides, Mr. Weld mentions only Wrico lettering guides. In spite 
of the fact that this reviewer leans toward Wrico guides, it would seem 
desirable to tell the reader at least what others exist, and in order to be of 
maximum help to explain the advantages and disadvantages of each. 

This reviewer is of the opinion that no one book about charts can hope to 
answer all chart questions. The design and use of charts involves so many 
decisions based on judgment and on knowledge outside the chart itself, 
that any book, however extensive, must leave many questions unanswered. 
The author of a book on charts is confronted with three major choices: a 
thorough job on principles, applying to all fields of charting; a thorough job 
on the detailed procedures for a limited field of charting; or a compromise 
combining some of each. In his present book, Mr. Weld has chosen to follow 
the latter course. He has succeeded very well in presenting the principles of 
simple charting, spiced with enough specific applications to make them ap¬ 
petizing and easy to digest. Although it is directed primarily to businessmen, 
this book has something to say also to nonbusiness chart makers. 

In addition to providing a helpful introduction to the beginner, this book 
fits neatly onto the bookshelf, next to more voluminous works, where it 
should serve as a compact reference to the author’s special field of interest. 


NOTE: As a result of the reduction in the number of pages in 
this issue of the Journal, the Statistical Methodology Index 
is being omitted. Headers are invited to write to the Secre¬ 
tary’s office concerning the desirability of continuing this 
series of bibliographies of the literature on statistical metho¬ 
dology. 
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